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■ Abstract: These lectures cover the basics of inflationary models for the early universe, 

concentrating particularly on the generation of density fluctuations from scalar-field dy- 
namics. The subsequent gravitational dynamics of these fluctuations in dark matter in a 
• Friedmann model are described, leading to a review of the current situation in confronting 

inflationary models with the latest data on the clustering of galaxies and other measures 
of large-scale structure. 
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1 General arguments for inflation 

1.1 The problems of classical cosmology 

The standard isotropic cosmology is a very successful framework for interpreting 
observations, but there are certain questions which, prior to the early 1980s, had to 
be avoided. These are encapsulated in a set of classical 'problems', as follows. 



The horizon problem 

Standard cosmology contains a particle horizon of comoving radius 
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which converges because R oc t 1 ^ 2 in the early radiation-dominated phase. At late 
times, the integral is dominated by the matter-dominated phase, for which 

6000 i 

-D H = Rq r H ~ — -j= h Mpc. (2) 



The horizon at last scattering was thus only ~ 100 Mpc in size, subtending an angle 
of about 1 degree. Why then are the large number of causally disconnected regions 
we see on the microwave sky all at the same temperature? 
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The flatness problem 



The Q = 1 universe is unstable: 



[1 - l/Q{z)\ = f(z) [1-1/rt)], 



(3) 



where f(z) = (1 + z)~ x in the matter-dominated era, oc (1 + z)~ 2 for radiation 
domination, so that f(z) ~ (1 + z eq )/(l + z) 2 at early times. To get J? ~ 1 now 
requires a fine tuning of Q in the past, which becomes more and more precisely 
constrained as we increase the redshift at which the initial conditions are supposed to 
have been imposed. Ignoring annihilation effects, 1 + z = T init /2.7 K; 1 + z eq ~ 10 4 , 
and so the required fine-tuning is 



At the Planck epoch, which is the natural initial time, this requires a deviation 
of only 1 part in 10 60 . This is satisfied if Q = 1 exactly, but a mechanism is still 
required to set up such an initial state. This equation is especially puzzling if Q ^ 1 
today: how could the universe 'know' to start with a deviation from Q = 1 just 
so tuned that the curvature starts to become important only now after so many 
e-foldings of the expansion? 

The antimatter problem 

At kT > m p c 2 , there exist in equilibrium roughly equal numbers of photons, pro- 
tons and antiprotons. Today, N p /N 1 ~ 10 -9 , but Np ~ 0. Conservation of baryon 
number would imply that N p /N p = 1 + O(10 -9 ) at early times Where did this initial 
asymmetry come from? 

The structure problem 

The Universe is not precisely homogeneous. We generally presume that galaxies and 
clusters grew via gravitational instability from some initial perturbations. What is 
the origin of these? 

This list can be extended to problems which come closer to astrophysics than 
cosmology per se. There is the question of the dark matter and its composition, 
for example. However, the above list encompasses problems which go back to the 
initial conditions of the Big Bang. It seems clear that conventional cosmological 
models need to be set up in an extremely special configuration, and this is certainly 
a deficiency of the theory. Critics can point out with some force that the big bang 
model explains nothing about the origin of the universe as we now perceive it, 
because all the most important features are 'predestined' by virtue of being built 
into the assumed initial conditions near to t = 0. 



^(t init ) - 1| < 10" 22 [£ init /GeV] 2 . 



(4) 
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The expansion problem 

Even the most obvious fact of the cosmo logical expansion is unexplained. Although 
general relativity forbids a static universe, this is not enough to understand the ex- 
pansion. The gravitational dynamics of R(t) are just those of a cannonball travelling 
vertically in the Earth's gravity. Suppose we see a cannonball rising at a given time 
t = to'- it may be true to say that it has r = ro and v = vq at this time because at a 
time At earlier it had r = r — voAt and v = vq — gAt, but it is hardly a satisfying 
explanation for the motion of a cannonball which was fired by a cannon. Neverthe- 
less, this is the only level of explanation that classical cosmology offers: the universe 
expands now because it did so in the past. Although it is not usually included in 
the list, one might thus with justice add an 'expansion problem' as perhaps the 
most fundamental of the catalogue of classical cosmological problems. Certainly, 
early generations of cosmologists were convinced that some specific mechanism was 
required in order to explain how the universe was set in motion. 

For many years, it was assumed that any solution to these difficulties would have 
to await a theory of quantum gravity. The classical singularity can be approached 
no closer than the Planck time of ~ 10~ 43 s, and so the initial conditions for the 
classical evolution following this time must have emerged from behind the presently 
impenetrable barrier of the quantum gravity epoch. There remains a significant 
possibility that this policy of blaming everything on quantum gravity may be correct, 
and this is what lies behind modern developments in quantum cosmology. This 
field is not really suitable for treatment at the present level, and it deals with the 
subtlest possible questions concerning the meaning of the wave function for the entire 
universe. The eventual aim is to understand if there could be a way in which the 
universe could be spontaneously created as a quantum-mechanical fluctuation, and 
if so whether it would have the initial properties which observations seem to require. 
Since this programme has to face up to the challenge of quantizing gravity, it is fair 
to say that there are as yet no definitive answers, despite much thought-provoking 
work. See chapter 11 of Kolb & Turner (1990) for an introduction. 

However, the great development of cosmology in the 1980s was the realization 
that the explanation of the initial-condition puzzles might involve physics at lower 
energies: 'only' 10 15 GeV. Although this idea, now known as inflation, cannot be 
considered to be firmly established, the ability to treat gravity classically puts the 
discussion on a much less speculative foundation. What has emerged is a general 
picture of the early universe of compelling simplicity, which moreover may be subject 
to observational verification. What follows is an outline of the main features of 
inflation; for more details see e.g. chapter 8 of Kolb & Turner (1990); Brandenberger 
(1990); Liddle k Lyth (1993). 

1.2 An overview of inflation 
Equation of state for inflation 

The list of problems with conventional cosmology provides a strong hint that the 
equation of state of the universe may have been modified at very early times. To 
solve the horizon problem and allow causal contact over the whole of the region 
observed at last scattering requires a universe that expands 'faster than light' near 
t = 0: R ex t a ; a > 1. If such a phase existed, the integral for the comoving horizon 
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would have diverged, and there would be no difficulty in understanding the overall 
homogeneity of the universe - this could then be established by causal processes. 
Indeed, it is tempting to assert that the observed homogeneity proves that such 
causal contact must once have occurred. This phase of accelerated expansion is the 
most general feature of what has become known as the inflationary universe. 

What condition does this place of the equation of state? In the integral for r H , 
we can replace dt by dR/R, which the Friedmann equation says is oc dR/^/pR 2 
at early times. Thus, the horizon diverges provided the equation of state is such 
that pR 2 vanishes or is finite as R — > 0. For a perfect fluid with p = (r — l)e as 
the relation between pressure and energy density, we have the adiabatic dependence 
p oc R~ 3F , and the same dependence for p if the rest- mass density is negligible. A 
period of inflation therefore needs 



r < 2/3 =>- pc 2 + 3p < 0. 



(5) 



An alternative way of seeing that this criterion is sensible is that the 'active mass 
density' p + 3p/c 2 then vanishes. Since this quantity forms the rhs of Poisson's 
equation generalized to relativistic fluids, it is no surprise that the vanishing of 
p + 3p/c 2 allows a coasting solution with R oc t. 

Such a criterion can also solve the flatness problem. Consider the Friedmann 
equation: 

# = 5^ - h». (6) 

As we have seen, the density term on the rhs must exceed the curvature term by a 
factor of at least 10 60 at the Planck time, and yet a more natural initial condition 
might be to have the matter and curvature terms being of comparable order of 
magnitude. However, an inflationary phase in which pR 2 increases as the universe 
expands can clearly make the curvature term relatively as small as required, provided 
inflation persists for sufficiently long. 



De sitter space and inflation 

We have seen that inflation will require an equation of state with negative pressure, 
and the only familiar example of this is the p = —pc 2 relation which applies for 
vacuum energy - in other words we are led to consider inflation as happening in a 
universe dominated by a cosmo logical constant. As usual, any initial expansion will 
redshift away matter and radiation contributions to the density, leading to increasing 
dominance by the vacuum term. If the radiation and vacuum densities are initially of 
comparable magnitude, we quickly reach a state where the vacuum term dominates. 
The Friedmann equation in the vacuum-dominated case has three solutions: 

R oc sinhHt k = — 1 

oc cosh Ht k = +l (7) 
oc exp Ht k = 0, 
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where H = ^Ac 2 /3 = A/87rG/9 vac /3. Thus, all solutions evolve towards the expo- 
nential k = solution, known as de Sitter space. Note that H is not the Hubble 
parameter at an arbitrary time (unless k = 0), but it becomes so exponentially fast 
as the hyperbolic trigonometric functions tend to the exponential. 

Because de Sitter space clearly has H 2 and p in the right ratio for Q = 1 
(obvious, since k = 0), the density parameter in all models tends to unity as the 
Hubble parameter tends to H. If we assume that the initial conditions are not 
fine-tuned (i.e. Q = 0(1) initially), then maintaining the expansion for a factor / 
produces 



n = i + o(r 2 )- 



(8) 



This can solve the flatness problem, provided / is large enough. To obtain Q of 
order unity today requires \Q — 1| < 10~ 52 at the GUT epoch, and so 



ln/>60 



(9) 



e-foldings of expansion are needed; it will be proved below that this is also exactly 
the number needed to solve the horizon problem. It then seems almost inevitable 
that the process should go to completion and yield Q = 1 to measurable accuracy 
today. There is only a rather small range of e-foldings (60 ± 2, say) around the 
critical value for which Q today can be of order unity without being effectively 
exactly unity, and it would constitute an unattractive fine-tuning to require that 
the expansion hit this narrow window exactly. 

This gives the first of two strong predictions of inflation: that the universe 
must be spatially flat 



inflation =^ k = 0. 



(10) 



Note that this need not mean the Einstein-de Sitter model; the alternative possibility 
is that a vacuum contribution is significant in addition to matter, so that f2 m + f2 v = 
1. Astrophysical difficulties in finding evidence for Q m = 1 are thus one of the major 
motivations, through inflation, for taking the idea of a large cosmological constant 
seriously. 



Reheating from inflation 

The discussion so far indicates a possible route to solving the problems with initial 
conditions in conventional cosmology, but has a critical missing ingredient. The 
idea of inflation is to set the universe expanding towards an effective k = state 
by using the repulsive gravitational force of vacuum energy or some other unknown 
state of matter which satisfies p < —pc 2 /3. There remains the difficulty of returning 
to a normal equation of state: the universe is required to undergo a cosmological 
phase transition. Such a suggestion would have seemed highly ad hoc. in the 1960s 
when the horizon and flatness problems were first clearly articulated by Dicke. The 
invention of inflation by Guth (1981) had to await developments in quantum field 
theory which provided a plausible basis for this phase transition. This mechanism 
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will be described below; it has been deliberately put off so far to emphasise the 
general character of many of the arguments for inflation. 

What will emerge is that it is possible for inflation to erase its tracks in a very 
neat way. If we are dealing with quantum fields at a temperature T, then an energy 
density ~ T 4 in natural units is expected in the form of vacuum energy. The vacuum- 
driven expansion produces a universe which is essentially devoid of normal matter 
and radiation; these are all redshifted away by the expansion, and so the temperature 
of the universe becomes < T. A phase transition to a state of zero vacuum energy, 
if instantaneous, would transfer the energy T 4 to normal matter and radiation as a 
latent heat. The universe would therefore be reheated: it returns to the temperature 
T at which inflation was initiated, but with the correct special initial conditions for 
the expansion. The transition in practical models is not instantaneous, however, and 
so the reheating temperature is lower than the temperature prior to inflation. 



Quantum fluctuations 

Note that de Sitter space contains an event horizon, in that the comoving distance 
that particles can travel between a time to and t = oo is finite 

(do not confuse this with the particle horizon, where the upper limit for the integral 
would be to). With R oc expHt, the proper radius of the horizon is Ror EH = c/H. 
Figure 1 illustrates the situation. The exponential expansion literally makes distant 
regions of space move faster than light, so that points separated by > c/H can never 
communicate with each other. 

As with black holes, it therefore follows that thermal Hawking radiation will 
be created. These quantum fluctuations in de Sitter spacetime provide the seeds for 
what will eventually become galaxies and clusters. The second main prediction of 
inflation is that such fluctuations should exist in all fields, in particular that there 
should exist a background of gravitational waves left as a relic of inflation. 

This idea of obtaining all structure in the universe (including ourselves) from 
quantum fluctuations is a magical idea of tremendous appeal. If it could be shown 
to be correct, it would rank as one of the greatest possible intellectual advances. We 
now have to look at some of the practical details to see how this concept might be 
made to function in practice, and how it may be tested. 



2 Inflation field dynamics 



The general concept of inflation rests on being able to achieve a negative-pressure 
equation of state. This can be realised in a natural way by quantum fields in the 
early universe. In order to understand what is going on, it is necessary to summarize 
some of the most important concepts and jargon from this field. 
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Fig. 1. The event horizon in de Sitter space. Particles outside the sphere at r = c/H 
can never receive light signals from the origin, not can an observer at the origin receive 
information from outside the sphere. The exponential expansion quickly accelerates any 
freely-falling observers to the point where their recession from the origin is effectively 
superluminal. 

2.1 Quantum fields and potentials 
Lagrangians and fields 

Consider the formulation of classical mechanics in terms of an action principle. 
We write the variational equation 



(12) 



where the Lagrangian L is the difference of the kinetic and potential energies, 
L = T — V, and the integral J L dt is the action. What this says is that, for particles 
described by coordinates qi(t), the path travelled by each particle between given 
starting and finishing positions is such that the action is extremal (not necessarily a 
minimum, even though one often speaks of the principle of least action) with respect 
to small variations in the paths. If we take the 'positions' qi(t) and 'velocities' qi{t) 
to be independent variables, then expansion of L in terms of small variations in the 
paths and integration by parts leads to Euler's equation for each particle 

d_ (dL\ _ dL_ 
dt \dqi ) dqi 

A field may be regarded as a dynamical system, but with an infinite number of 
coordinates, qi, which are the field values at each point in space. How do we handle 
this? A hint is provided by electromagnetism, where we are familiar with writing 




(13) 
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the total energy in terms of a density which, as we are dealing with generalized me- 
chanics, we may formally call the Hamiltonian density. This suggests that we write 
the Lagrangian in terms of a Lagrangian density C: L = J £ dV. This quantity 
is of such central importance in quantum field theory, that it is usually referred to 
(incorrectly) simply as 'the Lagrangian'. In these terms, our action principle now 
takes the pleasingly relativistic form 

sjc** = 0, (14) 

although note that to be correct in general relativity, the Lagrangian needs to take 
the form of a invariant scalar times the Jacobian y/—g. 

We can now apply the variational principle as before, considering £ to be a 
function of a general coordinate, <p which is the field, and the '4-velocity' d^(p. This 
yields the Euler-Lagrange equation 
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( dC \ dC 


Q xi x 


\d{d^)J ~ df 



(15) 



A Lagrangian then immediately gives a wave equation, as the following examples 
illustrate. The forms of most Lagrangians are often quite similar: if we want a wave 
equation linear in second derivatives of the field, then the Lagrangian must contain 
terms quadratic in first derivatives of the field. 

(1) Waves on a string. A good classical example of the Euler-Lagrange formalism 
is provided by waves on a string. If the density per unit length is cr, the tension is T, 
and we call the transverse displacement of the string y, then the (one-dimensional) 
Lagrangian density is 

C = \af - \Ty" (16) 

(at least for small displacements) . The potential term comes from the work done in 
stretching the string. Inserting this in the Euler-Lagrange equation yields a familiar 
result: 

ay-Ty" = 0. (17) 

This is just the wave equation, and it tells us that the speed of sound on a plucked 
string is jo. 

(2) Complex scalar field. Here we want a Lagrangian which will yield the Klein- 
Gordon equation (□ + (i 2 )(j) = 0. The required form is 

C={d^){d^Y (18) 

where the only subtlety is that phi and <fi* are treated as independent variables 
(rather than the real and imaginary parts of 0). For a real field, the Lagrangian 
becomes 



(19) 
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Noether's theorem 

The existence of global symmetries of the Lagrangian is closely connected with 
conservation laws in physics. In classical mechanics, conservation of energy and 
momentum arise by considering Euler's equation 



d f dL 



dt V dxj 



dL 

dxi 



0. 



(20) 



If L is independent of position, then we obtain conservation of momentum (or 
angular momentum, if x is an angular coordinate): pi = dL/dii = constant. If 
L has no explicit dependence on t, then 



dL (dL dL \ ^ 



(21) 



which leads us to define the Hamiltonian as a further constant of the motion 

H = ^PiQi — L = constant. (22) 

Something rather similar happens in the case of quantum (or classical) field 
theory: the existence of a global symmetry leads directly to a conservation law. The 
difference between discrete dynamics and field dynamics where the Lagrangian is a 
density is that the result is expressed as a conserved current rather than a simple 
constant of the motion. In what follows, we symbolize the field by 0, but this is not 
to imply that there is any restriction to scalar fields. If several fields are involved 
(e.g. <p and <p* for a complex scalar field), they should be summed over. 

Suppose the Lagrangian is independent of explicit dependence on spacetime (i.e. 
depends on x M only implicitly through the fields and their 4-derivatives). The algebra 
is similar to that above, and we obtain 



d 

dx v 



dC d(f> 



d(d"(/>) dx» 



- C9nv 



(23) 



This has produced a conserved tensor: the term in square brackets is to be identified 
with the energy-momentum tensor of the field, T^ v . 

Natural units 

To simplify the appearance of equations, it is universal practice in quantum field 
theory to adopt natural units where we take 



tl = C = Ho = €q = 1- 



(24) 



This convention makes the meaning of equations clearer by reducing the algebraic 
clutter, and is also useful in the construction of intuitive arguments for the order of 
magnitude of quantities in field theory. 
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The adoption of natural units corresponds to fixing the units of charge, mass, 
length and time relative to each other. This leaves one free unit, usually taken to 
be energy. Natural units are thus one step short of the Planck system, in which 
G = 1 also, so that all units are fixed and all physical quantities are dimensionless. 
In natural units, the following dimensional equalities hold: 

[E] = [m] 
[L] = [m]" 1 

Hence, the dimensions of energy density are 

[£] = H 4 , (26) 

with units usually quoted in GeV 4 . Thus, when we deal with quadratic derivative 
terms d^^d^cj) an d quadratic mass terms (m 2 (/> 2 , m 2 A^A^/2 etc.), the dimensions 
of the fields must clearly be 

[0] = [A"] = [m\. (27) 



(25) 



2.2 Equations of motion 
Quantum fields at high temperatures 

The critical fact we shall need from quantum field theory is that quantum fields can 
produce an energy density which mimics a cosmological constant. The discussion will 
be restricted to the case of a scalar field </> (complex in general, but often illustrated 
using the case of a single real field) . The restriction to scalar fields is not simply for 
reasons of simplicity, but because the scalar sector of particle physics is relatively 
unexplored. While vector fields such as electromagnetism are well understood, it is 
expected in many theories of unification that additional scalar fields such as the 
Higgs field will exist. We now need to look at what these can do for cosmology. 

The Lagrangian density for a scalar field is as usual of the form of a kinetic 
minus a potential term: 

C = \d^d^-V{cj ) ). (28) 
In familiar examples of quantum fields, the potential would be 

V((j>) =m 2 <p 2 /2, (29) 

where m is the mass of the field in natural units. However, it will be better to keep 
the potential function general at this stage. As usual, Noether's theorem gives the 
energy-momentum tensor for the field as 

= d»4>d v 4> - g^C. (30) 

From this, we can read off the energy density and pressure: 

p=l0 2 + y(0) + I(V0) 2 

p= l02_ y(0) _ 1 (V0) 2 



(31) 
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If the field is constant both spatially and temporally, the equation of state is then 
p = —p, as required if the scalar field is to act as a cosmological constant; note that 
derivatives of the field spoil this identification. 

If is a (complex) Higgs field, then the symmetry-breaking Mexican hat potential 
might be assumed: 



At the classical level, such potentials determine where \<p\ will be found in equilib- 
rium: at the potential minimum. In quantum terms, this goes over to the vacuum 
expectation value (O|0|O). However, these potentials do not include the inevitable 
fluctuations which will arise in thermal equilibrium. We know how to treat these in 
classical systems: at non-zero temperature a system of fixed volume will minimize 
not its potential energy, but the Helmholtz free energy F = V — TS, S being 
the entropy. The calculation of the entropy is technically complex, since it involves 
allowance for the quantum interactions with a thermal bath of background particles. 
However, the main result can be justified, as follows. The effect of the thermal inter- 
action must be to add an interaction term to the Lagrangian Ci n t(<f>, V0> where ip is a 
thermally-fluctuating field which corresponds to the heat bath. In general, we would 
expect dnt to have a quadratic dependence on \<p\ around the origin: jC[ n t oc \(f>\ 
(otherwise we would need to explain why the second derivative either vanished or 
diverged); the coefficient of proportionality will be an effective mass 2 that depends 
on the thermal fluctuations in ip. On dimensional grounds, this coefficient must be 
proportional to T 2 , although a more detailed analysis would be required to get the 
constant of proportionality. 

There is thus a temperature-dependent effective potential which we have to 
minimize: 



The effect of this on the symmetry-breaking potential is dramatic, as illustrated 
in Figure 2. At very high temperatures, the potential will be parabolic, with a 



and the symmetry is broken. In between, there must be three critical temperatures: 
at Ti, a second minimum appears in V e g at \<p\ ^ 0, and this will be the global 
minimum for some Ti <T\. For T < T2, the state at |0| = is known as the false 
vacuum, whereas the global minimum is known as the true vacuum. Finally, at 
T = T 3 , the curvature around the origin changes sign and there is only one minimum 
in the potential. The universe can no longer be trapped in the false vacuum and can 
make a first-order phase transition to the true vacuum state. 

The crucial point to note for cosmology is that there is an energy- density differ- 
ence between the two vacuum states: 



y(</>) = VH 2 + A|0| 4 . 



(32) 




(33) 



minimum at \(/)\ = 0, whereas 





(34) 



If we say that the zero of energy is such that V = in the true vacuum, this implies 
that the false- vacuum symmetric state displayed an effective cosmological constant. 
On dimensional grounds, this must be an energy density ~ m 4 in natural units, 
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Fig. 2. The temperature-dependent effective potential illustrated at several temperatures. 
For T > Ti, only the false vacuum is available; for T < T2 the true vacuum is energetically 
favoured and the potential approaches the zero-temperature form. For T < T3 the true 
vacuum is the only minimum. 



where m is the energy at which the phase transition occurs. For GUTs, m ~ 10 15 
GeV; in laboratory units, this implies 

Pvac= " 3 ! ^10 8 °kgm- 3 . (35) 

The inevitability of such a colossal vacuum energy in models with GUT-scale sym- 
metry breaking was the major motivation for the concept of inflation as originally 
envisaged by Guth (1981). At first sight, the overall package looks highly appeal- 
ing, since the phase transition from false to true vacuum both terminates inflation 
and also reheats the universe to the GUT temperature, allowing the possibility that 
GUT-based reactions which violate baryon-number conservation can generate the 
observed matter/antimatter asymmetry. 

However, while a workable inflationary cosmology will very likely deploy these 
basic elements of vacuum-driven expansion, fluctuation generation, and reheating, 
it has become clear that such a model must be more complex than Guth's initial 
proposal. To explain where the problems arise, we need to look in more detail at 
the functioning of the inflation mechanism. 



Dynamics of the inflation field 

Treating the field classically (i.e. considering the expectation value 
energy-momentum conservation (T^ u = 0) the equation of motion 



) , we get from 



+ 3if0 - V 2 (f) + dV/d(j) = 0. 



(36) 
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This can also be derived more easily by the direct route of writing down the action 
S = J £ \/—g d^x and applying the Euler-Lagrange equation which arises from a 
stationary action (y/—g = R 3 (t) for an FRW model, and this is the origin of the 
Hubble drag 3H(j) term). 

The solution of the equation of motion becomes tractable if we both ignore spatial 
inhomogeneities in 4> and make the slow-rolling approximation that |0| <C \3H<fi\, 
\dV/d(p\. Both these steps are required in order that inflation can happen; we have 
shown above that the vacuum equation of state only holds if in some sense cf> changes 
slowly both spatially and temporally. Suppose there are characteristic temporal and 
spatial scales T and X for the scalar field; the conditions for inflation are that the 
negative-pressure equation of state from V(4>) must dominate the normal-pressure 
effects of time and space derivatives: 



V > 2 /T 2 , V > <p 2 /X 2 , 



(37) 



hence \dV/d(f)\ ~ V/<f> must be > (j)/T 2 



The <p term can therefore be neglected 



in the equation of motion, which then takes the slow-rolling form 



3H<j) = -dV/d<j). 



(38) 



The conditions for inflation can be cast into useful dimensionless forms. The basic 
7»f condition can now be rewritten using the slow-roll relation as 




(39) 



Also, we can differentiate this expression to obtain the criterion V" <C V'/m P 
Using slow-roll once more gives 3H^>/m P for the rhs, which is in turn <C 3H^/V /m r 
because 4> 2 <C V, giving finally 



V = (V"/V) « 1 



(40) 



(recall that for de Sitter space H = y/8nGV((p)/3 ~ \fV /m P in natural units). 
These two criteria make perfect intuitive sense: the potential must be flat in the 
sense of having small derivatives if the field is to roll slowly enough for inflation to 
be possible. 

Similar arguments can be made for the spatial parts. However, they are less 
critical: what matters is the value of V0 = V CO moving <p/R- Since R increases ex- 
ponentially, these perturbations are damped away: assuming V is large enough for 
inflation to start in the first place, inhomogeneities rapidly become negligible. This 
'stretching' of field gradients as we increase the cosmo logical horizon beyond the 
value predicted in classical cosmology also solves a related problem which was his- 
torically important in motivating the invention of inflation - the monopole prob- 
lem. Monopoles are point-like topological defects which would be expected to arise 
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in any phase transition at around the GUT scale (t ~ 10~ 35 s). If they form at 
approximately one per horizon volume at this time, then it follows that the present 
universe would contain Q ^> 1 in monopoles (see e.g. section 7.6 of Kolb & Turner 
1990). This unpleasant conclusion is avoided if the horizon can be made much larger 
than the classical one at the end of inflation; the GUT fields have then been aligned 
over a vast scale, so that topological defect formation becomes extremely rare. 



Ending inflation 

Although spatial derivatives of the scalar field can thus be neglected, the same 
is not always true for time derivatives. Although they may be negligible initially, 
the relative importance of time derivatives increases as <fi rolls down the potential 
and V approaches zero (leaving aside the subtle question of how we know that the 
minimum is indeed at zero energy). Even if the potential does not steepen, sooner 
or later we will have e ~ 1 or \rj\ ~ 1 and the inflationary phase will cease. Instead 
of rolling slowly 'downhill', the field will oscillate about the bottom of the potential, 
but with the oscillations becoming damped by the 3H<p friction term. Eventually, 
we will be left with a stationary field which either continues to inflate without end 
(if V(4> = 0) > 0) or which simply has zero density. This would be rather a boring 
universe to inhabit, but fortunately there is a more realistic way in which inflation 
can end. We have neglected so far the couplings of the scalar field to matter fields. 
Such couplings will cause the rapid oscillatory phase to produce particles, leading 
to reheating. Thus, even if the minimum of V(4>) is at V = 0, the universe is left 
containing roughly as much energy density as it started with, but now in the form 
of normal matter and radiation - which starts the usual FRW phase, albeit with 
the desired special 'initial' conditions. 

As well as being of interest for completing the picture of inflation, it is essential 
to realise that these closing stages of inflation are the only ones of observational 
relevance. Inflation might well continue for a huge number of e-foldings, all but the 
last few satisfying e,rj <C 1. However, the scales which left the de Sitter horizon at 
these early times are now vastly greater than our observable horizon, c/Hq, which 
exceeds the de Sitter horizon by only a finite factor. If inflation terminated by 
reheating to the GUT temperature, then the expansion factor to today is 

a~l T ~ E GVT /E 1 . (41) 

The comoving horizon size at the end of inflation was therefore 

d H (t GUT ) ~ a"^ T [c/H GUT ] ~ [£ P /£ 7 ] E-^, (42) 

where the last expression in natural units uses H ~ \/V /E P ~ E^, VT /E P . For a 
GUT energy of 10 15 GeV, this is about 10 m. We need enough e-foldings to have 
stretched this to the present horizon size 



(43) 
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By construction, this is enough to solve the horizon problem, and it is also the 
number of e-foldings needed to solve the flatness problem. This is no coincidence, 
since we saw earlier that the criterion in this case was 



Now, a eq = p-y/p, and p = 3H 2 f2/8nG. In natural units, this translates to p ~ 
E 2 [c/H }~ 2 , or a~q ~ E 2 [c/ H Q ]~ 2 / E^. The expression for N is then identical to 
the one for N in the case of the horizon problem: the same number of e-folds will 
always solve both. 

Realizing that the observational regime corresponds only to the terminal phases 
of inflation is both depressing and stimulating. Depressing, because 4> may well not 
move very much during the last phases: our observations relate only to a small piece 
of the potential, and we cannot hope to recover its form without substantial a priori 
knowledge. Stimulating because observations even on very large scales must relate 
to a period where the simple concepts of exponential inflation and scale-invariant 
density fluctuations were coming close to breaking down. This opens the possibility 
of testing inflation theories in a way that would not be possible with data relating 
to only the simpler early phases. These tests take the form of tilt and gravitational 
waves in the final perturbation spectrum, to be discussed further below. 

2.3 Inflationary models 
Early inflation models 

These general principles contrast sharply with Guth's initial idea, where the po- 
tential was trapped at (p = 0, and eventually underwent a first-order phase tran- 
sition. This model suffers from the problem that it predicts far too large residual 
inhomogeneities after inflation is over. This is easily seen: because the transition is 
first-order, it proceeds by bubble nucleation where the vacuum tunnels between 
false and true vacuum. However, the extent of these bubbles will spread as a causal 
process, whereas outside the bubbles the exponential expansion of inflation is con- 
tinuing. This means that it is very difficult for the bubbles to percolate and eliminate 
the false vacuum everywhere, as is needed for an end to inflation. Instead, inflation 
continues indefinitely, with the bubbles of true vacuum having only a small filling 
factor at any time. This graceful exit problem motivated variants in which the 
potential is flatter near the origin, so that the phase transition is second order and 
can proceed smoothly everywhere. 

However, there is also a more general problem with Guth's model and its variants. 
If the initial conditions are at a temperature T GUT , we expect thermal fluctuations in 
(f>; the potential should generally differ from its minimum by an amount V ~ T£ UT , 
which is of the order of the difference between true and false vacua. How then is the 
special case needed to trap the potential near = to arise? We have returned to 
the sort of fine-tuned initial conditions from which inflation was designed to save 
us. 

Combined with the difficulties in achieving small inhomogeneities after inflation 
is over, Guth's original inflation model thus turned out to have insuperable diffi- 
culties. However, for many cosmologists the main concepts of inflation have been 
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too attractive to give up. The price one pays for this is to decouple inflation from 
standard particle physics (taking the liberty of including GUTs in this category): 
inflation can in principle be driven by the vacuum energy of any scalar field. The 
ideological inflationist will then take the position that such a field (the inflaton) 
must have existed, and that it is our task to work out its properties from empirical 
cosmological evidence, rather than a priori particle-physics considerations. Barring 
a credible alternative way of understanding the peculiarities of the initial conditions 
of the big bang, there is much to be said for this point of view. 

Chaotic inflation models 

Most attention is currently paid to the more general models where the field finds 
itself some way from its potential minimum. This idea is termed chaotic inflation 
(see e.g. Linde 1989). The name originates because this class of models is also quite 
philosophically different from other inflation models; it does not require that there 
is a single Friedmann model containing an inflation-driving scalar field. Rather, 
there could be some primordial chaos, within which conditions might vary. Some 
parts may attain the conditions needed for inflation, in which case they will expand 
hugely, leaving a universe inside a single bubble - which we inhabit. In principle this 
bubble has an edge, but if inflation persists for sufficiently long, the distance to this 
nastiness is so much greater than the current particle horizon that its existence has 
no testable consequences. 

A wide range of inflation models of this kind is possible, illustrating the free- 
dom which arises once the parameters of the theory are constrained only by the 
requirement that inflation be produced. Things become even less constrained once 
it is realized that inflation need not correspond to de Sitter space, even though this 
was taken for granted in early discussions. As discussed earlier, it is only necessary 
that the universe enter a phase of 'superluminal' expansion in which the equation 
of state satisfies p < —pc 2 /3. 

For a pure static field, we will have the usual p = —pc 2 vacuum equation of state, 
and so a significant deviation from de Sitter space requires a large contribution from 
<p terms (although the slow-roll conditions will often still be satisfied). Intuitively, 
this corresponds to a potential which must be steep in some sense that is deter- 
mined by the desired time dependence of the scale factor. Three special cases are of 
particular interest: 

(i) Polynomial inflation. If the potential is taken to be V oc <p a , then the scale- 
factor behaviour is very close to exponential. This becomes less true as a in- 
creases, but investigations are usually limited to (p 2 and (p 4 potentials on the 
grounds that higher powers are nonrenormalizable. 

(ii) Power-law inflation. On the other hand, a(t) oc t p would suffice, provided 
p > 1. The potential required to produce this behaviour is 



(iii) Intermediate inflation. Another simple time-dependence which suffices for 
inflation is a(t) oc exp[(£/£o) ]• In the slow-roll approximation, the required 
potential here is V{4>) oc </> _/3 , where (3 = 4(/ _1 — 1). 




(45) 
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There are in addition a plethora of more specific models with various degrees of 
particle-physics motivation. Since at the time of writing none of these seem likely to 
become permanent fixtures, these will mostly not be described in detail. The above 
examples are more than enough to illustrate the wide range of choice available. 



Criteria for inflation 

Successful inflation in any of these models requires > 60 e-foldings of the expansion. 
The implications of this are easily calculated using the slow-roll equation, which 
gives the number of e-foldings between 0i and 4>2 as 



For any potential which is relatively smooth, V ~ V/4>, and so we get N ~ 
(0start/wp) 2 , assuming that inflation terminates at a value of (j> rather smaller than 
the start. The criterion for successful inflation is thus that the initial value of the 
field exceeds the Planck scale: 



By the same argument, it is easily seen that this is also the criterion needed to 
make the slow-roll parameters e and r\ be <C 1. To summarise, any model in which 
the potential is sufficiently flat that slow-roll inflation can commence, will probably 
achieve the critical 60 e-foldings. Counterexamples can of course be constructed, but 
they have to be somewhat special cases. 

It is interesting to review this conclusion for some of the specific inflation models 
listed above. Consider a mass-like potential V = m 2 (p 2 . If inflation starts near the 
Planck scale, the fluctuations in V are ~ m% and our condition becomes m <C m P ; 
similarly, for V = A<^ 4 , the condition is weak coupling: A < 1. Any field with a 
rather flat potential will thus tend to inflate, just because typical fluctuations leave 
it a long way from home in the form of the potential minimum. In a sense, inflation 
is realized by means of 'inertial confinement': there is nothing to prevent the scalar 
field from reaching the minimum of the potential - but it takes a long time to do 
so, and the universe has inflated by a large factor in the meantime. 

This requirement for weak coupling and/or small mass scales near the Planck 
epoch is suspicious, since quantum corrections will tend to re-introduce the Planck 
scale. In this sense, as with the appearance of the Planck scale as the minimum 
required field value, it is not clear that the aim of realizing inflation in a classical 
way distinct from quantum gravity has been fulfilled. 
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3 Relic fluctuations from inflation 



3.1 Motivation 



We have seen that de Sitter space contains a true event horizon, of proper size c/H. 
This suggests that there will be thermal fluctuations present, as with a black hole, 
for which the Hawking temperature is kT H = hc/(4nr s ). This analogy is close, 
but imperfect, and the characteristic temperature of de Sitter space is a factor of 2 
higher: 



kT d _ 
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The details of how these fluctuations translate into density perturbations after in- 
flation are somewhat technical. However, we can immediately note that a natural 
prediction will be a spectrum of perturbations which are scale invariant. This means 
that the metric fluctuations of spacetime receive equal levels of distortion from each 
decade of wavelength of perturbation, and may be quantified in terms of the fluctu- 
ations in Netwonian gravitational potential, ^ (c = 1): 




(49) 



The notation 6 n arises because the potential perturbation is of the same order as 
the density fluctuation on the scale of the horizon at any given time. 

It is commonly argued that the scale-invariant prediction arises because de Sitter 
space is invariant under time translation: there is no natural origin of time under 
exponential expansion. At a given time, the only length scale in the model is the 
horizon size c/H, so it is inevitable that the fluctuations which exist on this scale 
are the same at all times. After inflation ceases, the resulting fluctuations (constant 
amplitude on the scale of the horizon) give us the Zeldovich or scale-invariant 
spectrum. The problem with this argument is that it ignores the issue of how the 
perturbations evolve while they are outside the horizon; we have only really calcu- 
lated the amplitude for the last generation of fluctuations - i.e. those which are on 
the scale of the horizon at the time inflation ends. Fluctuations generated at ear- 
lier times will be inflated outside the de Sitter horizon, and will re-enter the FRW 
horizon at some time after inflation has ceased. 

The evolution during this period is a topic where some care is needed, since 
the description of these large-scale perturbations is sensitive to the gauge freedom 
in general relativity. A technical discussion is given in e.g. Mukhanov, Feldman & 
Brandenberger (1992), but there is no space to do this justice here. Rather, we 
shall rely on simply motivating the result, which is that potential perturbations re- 
enter the horizon with the same amplitude they had on leaving. This may be made 
reasonable in two ways. Perturbations outside the horizon are immune to causal 
effects, so it is hard to see how any large-scale non-flatness in spacetime could 
'know' whether it was supposed to grow or decline. More formally, we shall show 
below that small potential perturbations preserve their value, provided they are on 
scales where pressure effects can be neglected, and that this critical scale corresponds 
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to the horizon. We therefore argue that the inflationary process produces a universe 
which is fractal-like in the sense that scale-invariant fluctuations correspond to a 
metric which has the same 'wrinkliness' per log length-scale. It then suffices to 
calculate that amplitude on one scale - i.e. the smallest one where super-horizon 
evolution is not an issue. It is possible to alter this scale-invariant prediction only 
if the expansion is non-exponential; we have seen that such deviations plausibly do 
exist towards the end of inflation. 

To anticipate the detailed treatment, the inflationary prediction is of a horizon- 
scale amplitude 



(50) 



which can be understood as follows. Imagine that the main effect of fluctuations 
is to make different parts of the universe have fields which are perturbed by an 
amount 5(f). In other words, we are dealing with various copies of the same (j)(t) 
rolling behaviour, but viewed at different times 
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These universes will then finish inflation at different times, leading to a spread 
in energy densities. The horizon-scale density amplitude is given by the different 
amounts that the universes have expanded following the end of inflation: 



5 H ~ H 5t 



H 2 
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(52) 



where the last step uses the crucial input of quantum field theory, which is to say 
that the rms 5(f> = H/2tx. This result will be derived below, but it is immediately 
reasonable on dimensional grounds (in natural units, the field has the dimension of 
a temperature). 



3.2 The fluctuation spectrum 

We now need to go over this vital result in rather more detail. First, consider the 
equation of motion obeyed by perturbations in the inflaton field. The basic equation 
of motion is 

+ 3#0- V 2 (p + V'((f)) = 0, (53) 

and we seek the corresponding equation for the perturbation 5(f) obtained by start- 
ing inflation with slightly different values of (f> in different places. Suppose this per- 
turbation takes the form of a comoving plane-wave perturbation of amplitude A: 
5(f> = Aexpfzk ■ x — ikt/a]; the perturbed field 5(f) obeys the same equation of motion 
as the main field. If the slow-roll conditions are also assumed, so that V may be 
treated as a constant, we get 



[6<f>] + 3H[5(p] + (k/a) 2 5(p = 0, 



(54) 
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which is a standard wave equation for a massless field evolving in an expanding 
universe. 

Having seen that the inflaton perturbation behaves in this way, it is not much 
work to obtain the quantum fluctuations which result in the field at late times (i.e. 
on scales much larger than the de Sitter horizon) . First consider the fluctuations in 
flat space: the field would be expanded as 

4> k = uj k a k + Uk a h ( 55 ) 

and the field variance would be 

(0| \<p k \ 2 |0> = \u k \ 2 . (56) 

To solve the general problem, we only need to find how the amplitude u k changes 
as the universe expands. The idea is to start from the situation where we are well 
inside the horizon (k/a^> H), in which case flat-space quantum theory will apply, 
and end at the point of interest outside the horizon (where k/a <C H). 

Before finishing the calculation, note the critical assumption that the initial 
state is the vacuum: in the modes that will eventually be relevant for observational 
cosmology, we start with not even one quantum of the inflaton field present. Is 
this smuggling fine-tuning of the initial conditions back in through the back door? 
Given our ignorance of the exact conditions in the primordial chaos from which the 
inflationary phase is supposed to emerge, it is something of a matter of taste whether 
this is seen as being a problem. Certainly, if the initial state is close to equilibrium 
at temperature T, this is easily understood, since all initial scales are given in terms 
of T. In natural units, the energy density in V(4>) and radiation will be ~ T 4 , 
and the proper size of the horizon will be ~ T~ 2 . In the initial state, the inflaton 
occupation number will be 1/2 for very long wavelengths, and will fall for proper 
wavelengths < T _1 . Now, remember that T is in units of the Planck temperature, 
so that T ~ 10 -4 for GUT-scale inflation. That means that perturbations of scale 
smaller than T times the horizon would start with n ~ for a thermal state. 
However, this is only the initial state, and we expect that occupation number will 
be an adiabatic invariant which is constant for a given comoving wavelength. Thus, 
after ln(l/T) e-foldings of inflation (i.e. a few), every mode that remains inside the 
horizon will have the required zero occupation number. Of course, the motivation 
for a thermal initial state is weak, but the main point can be made in terms of 
energy density. If inflation is to happen at all, V(4>) must dominate, and it cannot 
do this if the inflaton fluctuations exist down to zero wavelength because the effective 
radiation density would then diverge. It thus seems reasonable to treat any initial 
state that inflates as rapidly entering a vacuum state. 

It is also worth noting in passing that these fluctuations in the scalar field can 
in principle affect the progress of inflation itself. They can be thought of as adding 
a random-walk element to the classical rolling of the scalar field down the trough 
defined by V(<p). In cases where <p is too close to the origin for inflation to persist 
for sufficiently long, it is possible for the quantum fluctuations to push (f> further 
out - creating further inflation in a self-sustaining process. This is the concept of 
stochastic inflation (Linde 1986, 1989). 

Returning now to the calculation, we want to know how the mode amplitude 
changes as the wavelength passes through the horizon. Initially, we have the (ex- 
panding) flat-space result 
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The powers of scale factor, a(t), just allow for expanding the field in comoving 
wavenumbers k. The field amplitude contains a normalizing factor of F -1 / 2 , V 
being a proper volume, hence the a -3 / 2 factor, if we use comoving V = 1. Another 
way of looking at this is that the proper number density of inflatons goes as a~ s as 
the universe expands. With this boundary condition, it straightforward to check by 
substitution that the following expression satisfies the evolution equation: 




(58) 



(remember that H is a constant, so that (d/dt)[aH] = Ha = aH 2 etc.). At early 
times, when the horizon is much larger than the wavelength, aH/k <C 1, and so Uk 
is the flat-space result, except that the time dependence looks a little odd, being 
exp[— ik/aH]. However, since (d/dt)[k/aH] = —k/a, we see that the oscillatory term 
has a leading dependence on t of the desired kt/a form. In the limit of very early 
times, the period of oscillation is <C -ff -1 , so a is effectively a constant from the 
point of view of the epoch where quantum fluctuations dominate. 

At the opposite extreme, aH/k ^> 1, the fluctuation amplitude becomes frozen 
out at the value 
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The initial quantum zero-point fluctuations in the field have been transcribed to 
a constant classical fluctuation which can eventually manifest itself as large-scale 
structure. The fluctuations in depend on k in such a way that the fluctuations per 
decade are constant: 
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This completes the argument. The rms value of fluctuations in <p can be used as 
above to deduce the power spectrum of mass fluctuations well after inflation is over. 
In terms of the variance per In k in potential perturbations, the answer is 
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(61) 



where we have written once again the exact relation between H and V and the 
slow-roll condition, since manipulation of these three equations is often required in 
derivations. 
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This result calls for a number of comments. First, if H and <j) are both constant 
then the predicted spectrum is exactly scale-invariant, with some characteristic in- 
homogeneity on the scale of the horizon. As we have seen, exact de Sitter space with 
constant H will not be strictly correct for most inflationary potentials; nevertheless, 
in most cases the main points of the analysis still go through. The fluctuations in 
4> start as normal flat-space fluctuations (and so not specific to de Sitter space), 
which change their character as they are advected beyond the horizon and become 
frozen-out classical fluctuations. All that matters is that the Hubble parameter is 
roughly constant for the few e-foldings that are required for this transition to hap- 
pen. If H does change with time, the number to use is the value at the time that a 
mode of given k crosses the horizon. Even if H were to be made precisely constant, 
there remains the dependence on </>, which again will change as different scales cross 
the horizon. This means that different inflationary models display different charac- 
teristic deviations from a nearly scale-invariant spectrum, and this is discussed in 
more detail below. Two other characteristics of the perturbations are more general: 
they should be Gaussian and adiabatic in nature. A Gaussian density field is one 
for which the joint probability distribution of the density at any given number of 
points is a multivariate Gaussian. The easiest way for this to arise in practice is 
for the density field to be constructed as a superposition of Fourier modes with 
independent random phases; the Gaussian property then follows from the Central 
Limit Theorem. It is easy to see in the case of inflation that this requirement will be 
satisfied: the quantum commutation relations only apply to modes of the same k, 
so that modes of different wavelength behave independently and have independent 
zero-point fluctuations. Finally, the principal result of the inflationary fluctuations 
in their late-time classical guise is as a perturbation to curvature, and it is not easy 
to see how to produce the separation in behaviour between photon and matter per- 
turbations which is needed for isocurvature modes. Towards the end of inflation, the 
universe contains nothing but scalar field and whatever mechanisms that generate 
the matter /antimatter asymmetry have yet to operate. When they do, the result 
will be a universal photon/baryon ratio but with a total density modulated by the 
residual inflationary fluctuations - adiabatic initial conditions, in short. 

Inflation thus makes a relatively firm prediction about the statistical character 
of the initial density perturbations, plus a somewhat less firm prediction for their 
power spectrum. With sufficient ingenuity, the space of predictions can be widened; 
isocurvature perturbations can be produced at the price of introducing additional 
inflation fields and carefully adjusting the coupling between them (Kofman & Linde 
1987), and breaking the Gaussian character of the fluctuations is also possible in 
such multi-field models (Yi & Vishniac 1993) - essentially because all modes in 
field 2 can respond coherently to a fluctuation in field 1, in much the same way 
as non-Gaussian perturbations are generated by cosmic strings. However, a nearly 
scale-free Gaussian adiabatic spectrum is an inevitable result in the simplest models 
with a single inflaton; if the theory is to have any predictive power and not to appear 
contrived, this is the clear prediction of inflation. As we shall see in the observational 
sections, the true state of affairs seems to be close to this state. 
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Inflaton coupling 

The calculation of density inhomogeneities sets an important limit on the inflation 
potential. From the slow-rolling equation, we know that the number of e-foldings of 
inflation is 



N = J Hdt = J Hd(p/<j) = J 3H 2 d<t>/V. 



(62) 



Suppose V{4>) takes the form V = \<f) 4 , so that N = H 2 /(2\(j) 2 ). The density 
perturbations can then be expressed as 

5 H ~ tL = „ \ 1/2 N 3/2 . (63) 
(f> v 

Since N > 60, the observed 5 H ~ 10 requires 



A < 10" 15 . 



(64) 



Alternatively, in the case of V = m 2 (f) 2 , 5 H = 3H 3 / (2m 2 </>). Since H ~ \/V /m P , this 
gives 5 H ~ mcj) 2 /m 3 ~ 10 ~ 5 . Since we have already seen that > m P is needed for 
inflation, this gives 



m < 10 5 m P . 



(65) 



These constraints appear to suggest a defect in inflation, in that we should be 
able to us the theory to explain why 5 H ~ 10 -5 , rather than using this observed fact 
to constrain the theory. The amplitude of 8 H is one of the most important numbers in 
cosmology, and it is vital to know if there is a simple explanation for its magnitude. 
Such an explanation does exist for theories based on topological defects, where we 
would have 

S H ~ [£ GUT /£ P ] 2 . (66) 

In fact, the situation in inflation is similar, since another way of expressing the 
horizon-scale amplitude is 

* H ~ ^7i- (67) 

We have argued that inflation will end with e of order unity; if the potential were 
to have the characteristic value V ~ -^gut th en this would give the same prediction 
for 8 H as in defect theories. The appearance of a tunable 'knob' in inflation theories 
really arises because we need to satisfy 4> ~ m P (for enough inflation) , while dealing 
with the characteristic value V ~ E^ VT (to be fair, this is likely to apply only at 
the start of inflation, but the potential does not change by a large factor 60 e- 
folds from the end of inflation unless the total number of e-folds is ^> 60). It is 
therefore reasonable to say that a much smaller horizon-scale amplitude would need 
V <^ -^gut? ^- e - a smaller E GUT than the conventional value. 
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This section has demonstrated the cul-de-sac in which inflationary models now 
find themselves: the field which drives inflation must be very weakly coupled - and 
effectively undetectable in the laboratory. Instead of Guth's original heroic vision 
of a theory motivated by particle physics, we have had to introduce a new entity 
into particle physics which exists only for cosmological purposes. In a sense, then, 
inflation is a failure. However, the hope of a consistent scheme eventually emerging 
(plus the lack of any alternative), means that inflationary models continue to be 
explored with great vigour. 

3.3 Gravity waves and tilt 

The density perturbations left behind as a residue of the quantum fluctuations in 
the inflaton field during inflation are an important relic of that epoch, but are not 
the only one. In principle, a further important test of the inflationary model is that 
it also predicts a background of gravitational waves, whose properties couple with 
those of the density fluctuations. 

It is easy to see in principle how such waves arise. In linear theory, any quantum 
field is expanded in a similar way into a sum of oscillators with the usual creation 
and annihilation operators; the above analysis of quantum fluctuations in a scalar 
field is thus readily adapted to show that analogous fluctuations will be generated 
in other fields during inflation. In fact, the linearized contribution of a gravity wave 
h^v to the Lagrangian looks like a scalar field = (m-p/Ay/ft) h^, so the expected 
rms gravity-wave amplitude is 

/i rms ~ H/m P . (68) 

The fluctuations in are transmuted into density fluctuations, but gravity waves 
will survive to the present day, albeit redshifted. 

This redshifting produces a break in the spectrum of waves. Prior to horizon 
entry, the gravity waves produce a scale-invariant spectrum of metric distortions, 
with amplitude h rms per Ink. These distortions are observable via the large-scale 
CMB anisotropies, where the tensor modes produce a spectrum with the same scale 
dependence as the Sachs- Wolfe gravitational redshift from scalar metric perturba- 
tions. In the scalar case, we have 5T/T ~ (p/3c 2 - i.e. of order the Newtonian metric 
perturbation; similarly, the tensor effect is 

~ hrms < $h ~ 10" 5 (69) 

GW 

(where the second step follows because the tensor modes can make no more than 
100% of the observed CMB anisotropy). The energy density of the waves is p GW ~ 
m 2 h 2 k 2 , where k ~ H(a en t ry ) is the proper wavenumber of the waves. At horizon 
entry, we therefore expect 

Pgw ~ ml h 2 ms H 2 (a entYy ). (70) 

After horizon entry, the waves redshift away like radiation, as a -4 , and generate 
a present-day energy spectrum per Ink which is constant for modes which en- 
tered the horizon while the universe was radiation dominated (because a oc t 1 ! 2 =>- 
H 2 a 4 = const). What is the density parameter of these waves? In natural units, 
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ft = (8n/3)p/(H 2 m 2 ,) : so ft GW ~ h 2 ms at the time of horizon entry - at which 
epoch the universe was radiation dominated with ft r = 1 to an excellent approx- 
imation. Thereafter, the wave density maintains a constant ratio to the radiation 
density, since both redshift as a -4 , giving the present-day density as 



ft GW ~ ft r [H/m P ] 2 ~ 10- 4 V/mi. 



(71) 



Therefore, just as with density perturbations in dark matter, the gravity-wave spec- 
trum displays a break between constant metric fluctuations on super-horizon scales 
to constant density fluctuations on small scales. If gravity waves make an important 
contribution to CMB anisotropies, h rms ~ 10 -5 and ft GW ~ 10 -14 is expected. 

A gravity-wave background of a similar flat spectrum is also predicted from 
cosmic strings (see section 10.4 of Vilenkin & Shellard 1994). Here, the prediction 
is 

ft GW ~ 100 [G7//c 2 ] ft r , (72) 

where fx is the mass per unit length. A viable string cosmology requires Gfi/c 2 ~ 
10 -5 , so ft GW ~ 10 -7 is expected - much higher than the inflationary prediction. 

The part of the spectrum with periods of order years would perturb the emis- 
sion from pulsars through fluctuating gravitational redshifts, and the absence of this 
modulation sets a bound of ft < 10~ 7 , so that V < 10~ 2 m 4 . However, this is still 
a very long way from the interesting inflationary level of ft GW < 10~ 14 . Although 
the strains implied by this level of relic gravity-wave background are tiny, it is not 
completely inconceivable that space-based versions of the same interferometer tech- 
nology being used to search for kHz-period gravity waves on Earth might eventually 
reach the required sensitivity. A direct detection of the gravity-wave background at 
the expected level would do much the same for the credibility of inflation as was 
achieved for the Big Bang itself by Penzias & Wilson in 1965. 

An alternative way of presenting the gravity- wave effect on the CMB anisotropies 
is via the ratio between the tensor effect of gravity waves and the normal scalar 
Sachs- Wolfe effect, as first analysed in a prescient paper by Starobinsky (1985). 
Express the fractional temperature variance as the contribution of a given spherical 
harmonic, Cg; for a scale-invariant spectrum, £ 2 Cg is a constant. The tensor and 
scalar contributions are respectively 

P CJ ~ h 2 ms ~ [H 2 /ml] ~ V/mi. (73) 

I 2 C? ~ 51 ~ H 2 /^ ~ H e /(V>) 2 ~ (74) 

The ratio of tensor to scalar variances of microwave background anisotropies is 
therefore proportional to the inflationary parameter e: 

§Ul2-4e, (75) 



inserting the exact coefficient from Starobinsky (1985). If it could be measured, the 
gravity- wave contribution to CMB anisotropies would therefore give a measure of one 
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of the dimensionless inflation parameters, e. The less de Sitter-like the inflationary 
behaviour, the larger the relative gravitational-wave contribution. 

Since deviations from exact exponential expansion also manifest themselves as 
density fluctuations which have spectra that deviate from scale invariance, this sug- 
gest a potential test of inflation. Define the tilt of the fluctuation spectrum as: 

din 5?, 

tllt = ( 1 -"» = -7i^- < 76 > 

We then want to express the tilt in terms of parameters of the inflationary potential, 
e and rj. These are of order unity when inflation terminates; e and r\ must therefore be 
evaluated when the observed universe left the horizon, recalling that we only observe 
the last 60-odd e-foldings of inflation. The way to introduce scale dependence is to 
write the condition for a mode of given comoving wavenumber to cross the de Sitter 
horizon 

a/k = H- 1 . (77) 

Since H is nearly constant during the inflationary evolution, we can replace d/d In k 
by d In a, and use the slow-roll condition to obtain 

d d 4> d nip V d . . 

dlnk~ a da~Hd0~ ~ 8^ V #' ^ ' 

We can now work out the tilt, since the horizon-scale amplitude is 

- Ht US " V3 (79) 



[2tt</>] 2 3 m%V' 2 ' 



and derivatives of V can be expressed in terms of the dimensionless parameters e 
and rj. The tilt of the density perturbation spectrum is thus predicted to be 



(1 - n) = 6e - 2rj 



(80) 



For most models in which the potential is a smooth polynomial-like function, 
\rj\ ~ |e|. Since e has the larger coefficient and is positive by definition, a general but 
not unavoidable prediction of inflation is that the spectrum of scalar perturbations 
should be slightly tilted in the sense that n is slightly less than unity. With a similar 
level of confidence, one can state that there is a coupling between this tilt and the 
level of the gravity-wave contribution to CMB anisotropies: 

%*6(l-n) (81) 

In principle, this is a distinctive prediction of inflation, but it is a test which loses 
power the more closely the fluctuations approach scale invariance. 

It is interesting to put flesh on the bones of this general expression and evaluate 
the tilt for some specific inflationary models. This is easy in the case of power-law 
inflation with a oc t p because the inflation parameters are constant: e = rj/2 = 1/p, 
so that the tilt here is always 



Inflationary Cosmology and Structure Formation 27 



(l-n) = 2/p (82) 

In general, however, the inflation derivatives have to be evaluated explicitly on the 
largest scales, 60 e-foldings prior to the end of inflation, so that we need to solve 

r 8tt f 4 ' V 

60 = Hdt=^r — d<f>. (83) 

J m P ^end V 

A better motivated choice than power-law inflation would be a power-law potential 
V(4>) oc (p 01 ; many chaotic inflation models concentrate on a = 2 (mass-like term) or 
a = 4 (highest renormalizable power). Here, e = rn^ot 2 / (167T(/> 2 ), rj = ex 2(a — l)/a, 
and 




It is easy to see that </> en d <C 4> an d that e = a/240, leading finally to 

(1 -n) = (2 + a)/120. (85) 

The predictions of simple chaotic inflation are thus very close to scale invariance 
in practice: n = 0.97 for a = 2 and n = 0.95 for a = 4. However, such a tilt has 
a significant effect over the several decades in k from CMB anisotropy measure- 
ments to small-scale galaxy clustering. These results are in some sense the default 
inflationary predictions: exact scale invariance would be surprising, as would large 
amounts of tilt. Either observation would indicate that the potential must have a 
more complicated structure (or that the inflationary framework is not correct). 



4 Gravitational dynamics of fluctuations 

4.1 Gravitational perturbation theory 

We now summarize briefly some of the basics relating to the growth of density 
perturbations in the post-inflationary phase. The study of perturbations in general 
relativity can be a rather complicated and messy subject. Fortunately, most of the es- 
sential physics can be extracted from a Newtonian approach (not so surprising when 
one remembers that small perturbations mean weak gravitational fields). We start 
by writing down the fundamental equations governing fluid motion (non-relativistic 
for now): 

Dw Vp „^ 

Euler : = - W 

Dt p 

F D P X7 ( 86 ) 

Energy : —— = — pv ■ v 

6J Dt ' 
Gauss : V 2 ^ = AnGp, 

where D/Dt = d/dt + v • V is the usual co moving derivative. We now linearize 
these by collecting terms of first order in perturbations about a homogeneous back- 
ground: p = po + dp etc. The result looks simpler if we define the fractional density 
perturbation 
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Sp 
Po' 



(87) 



Also, when dealing with time derivatives of perturbed quantities, the full comoving 
time derivative D/Dt can be replaced by d/dt = d/dt + Vo • V - which is the time 
derivative for an observer comoving with the unperturbed expansion of the Universe. 
We then can write 



±Sv = 
dt 



VSp 



V 6<P - (Sv • V)v 



dt 6 = - V 



(88) 



Po 
Sv 

V 2 5$ = 4nGp S. 

The next step is to translate spatial derivatives into comoving coordinates: 

x(t) = a(t)r(t) V -> -V, (89) 

ct 

and to make a similar transformation for peculiar velocity: 

Sv = au (90) 

(although note that u is still a function of time, unlike r) . 

When the equations are recast in these variables, there is only one complicated 
term to be dealt with: (Sv ■ V)v on the rhs of the perturbed Euler equation. This 
is best attacked by writing it in components: 



[{SvV)v } j = [Sv] i V i [v } j = H[Si 



(91) 



where the last step follows because vo = -ffxo =>- Vi[^o]j = HSij. The equations 
for conservation of momentum and matter then take the following simple forms in 
comoving units: 



u + 2-u 

a a 

8 = -V • u. 



g VSp 



Po 



(92) 



The peculiar gravitational acceleration is denoted by g. Note that, in the absence 
of peculiar accelerations and pressure forces, velocities redshift away through the 
'Hubble drag' term 2Hu. This behaviour is reasonable: if we shoot a bullet away 
from us with a proper peculiar velocity v, then after time t it is vt away, and its 
near neighbours have a recessional velocity H vt. The proper velocity thus decays as 
v + Hv = or ii + 2Hu = 0, because the bullet is always having to overtake distant 
galaxies with progressively higher speeds. 

After doing all this, we still have three equations in four variables (5, u, 5^, Sp). 
The system needs an equation of state to be closed, which may be specified in terms 
of the sound speed 



dp 
dp' 



(93) 
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If we now think of a plane-wave disturbance 5 oc e lk r (so that k is a comoving wave- 
vector) , then an equation for 5 can be obtained by eliminating u (take the divergence 
of the perturbed Euler equation and the time derivative of the continuity equation 
and eliminate V ■ u) : 



5 + 2-5 = 5[AnG Po - c 2 s k 2 /a 2 ]. 

Ct 



(94) 



This equation is the one which governs gravitational amplification of density 
perturbations. If we had linearized about a stationary background and taken k to 
be a proper wave-vector, then the equation would have been much easier to derive, 
and the result would be just 5 = 5(4%Gpo — c 2 k 2 ), which has the solutions 



5{t) 



l/^AnGpo-clh 2 . 



(95) 



There is a critical proper wavelength, known as the Jeans' Length, at which we 
switch from the possibility of exponential growth for long- wavelength modes to 
standing sound waves at short wavelengths. This critical length is 



(96) 




and clearly delineates the scale at which sound waves can cross an object in about 
the time needed for gravitational free-fall collapse. When considering perturbations 
in an expanding background, things are more complex. Qualitatively, we expect to 
have no growth when the 'driving term' on the rhs is negative. However, owing to 
the expansion, Aj will change with time, and so a given perturbation may switch 
between periods of growth and stasis. These effects help to govern the form of the 
perturbation spectrum which is propagated to the present Universe from early times, 
and will be considered in detail shortly. 



Radiation-dominated universes 

At early enough times, the Universe was radiation dominated (c s = c/\/3) and 
the analysis so far does not apply. It is conventional to resort to general relativity 
perturbation theory at this point. However, the fields are still weak, and so it is 
possible to generate the results we need by using special relativity fluid mechanics 
and linearized Einstein gravity. For simplicity, assume that pressure gradients are 
negligible (i.e. restrict ourselves to A 3> A,, from the start). The basic equations are 
then 

Dv 

Euler : = — 

Dt 



Energy: ^(p + p/c 2 ) = ^p/c 2 - {p + p/c 2 )V • v (9 ° 
Gauss : V 2 = 4nG(p + 3p/c 2 ). 
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For total radiation domination, p = pc 2 /3 and we can linearize as before, to obtain 

n a ■ 32/T 

5 + 2-5 = —-Gp 5, 98 
a 3 



so the net result of all the relativistic corrections is a driving term which is a factor 
8/3 higher. 

Solutions for S(t) 

In both matter- and radiation-dominated universes with Q = 1, we have po oc 1/t 2 : 

2 

Matter domination (a oc t 2 ^ 3 ) : AnGpo = — ~ 

3t (99) 
Radiation domination (a oc t 1 ^ 2 ) : 32nGpo/3 = 

Every term in the equation for 5 is thus the product of derivatives of 5 and powers 
of t, and a power-law solution is obviously possible. If we try 5 oc t n , then the result 
is n = 2/3 or —1 for matter domination; n = ±1 for radiation domination. For 
the growing mode, these can be combined rather conveniently using the conformal 
time rj = J dt/a: 



5 oc 



(100) 



Recall that rj is proportional to the comoving size of the horizon. 

One further way of stating this result is that gravitational potential perturbations 
are independent of time (at least while Q = 1). Poisson's equation tells us that 
—k 2 $/a 2 oc p5; since p oc a~ 3 for matter domination or a~ 4 for radiation, that 
gives $ oc 5/ a or 5 /a 2 respectively - independent of a in either case. In other 
words, the metric fluctuations resulting from potential perturbations are frozen, at 
least for perturbations which are outside the horizon. This conclusion demonstrates 
the self-consistency of the basic set of fluid equations which were linearized. The 
equations for momentum and energy conservation are always valid, but the correct 
relativistic description of linear gravity should be a wave equation, including time 
derivatives of $ as well as spatial derivatives. Since we have used only Newtonian 
gravity, this implies that any solutions of the perturbation equations in which 
varies will not be valid on super- horizon scales. This criticism does not apply to the 
growing mode, where ^ is constant, but it does apply to decaying modes (Press & 
Vishniac 1980). This difficulty concerning perturbations on scales greater than the 
horizon is related to gauge freedom in general relativity and the fact that the value 
of density perturbations 5 can be altered by a coordinate transformation. Exchange 
of light signals can be used to establish a 'sensible' coordinate system, but only on 
sub- horizon scales. Otherwise, the results obtained are gauge dependent. We are 
implicitly using here what might be called the Newtonian gauge, where the the 
metric is expressed as the FRW form with perturbation factors (1 ± 2^/c 2 ) in the 
time and spatial parts respectively. 
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In models with Q < 1, the growth is slowed. It is possible to write explicit 
expressions for 5(a), but it is more convenient in practice to use the following accu- 
rate approximation, due to Carroll et al. (1992), which also allows for the effects of 
vacuum energy: 



5(z = 0, Q) 
5(z = 0, Q = 1) 



5 

2"w 



-Q v + (l + \Q m )(l + ^Q v ) 



(101) 



For models without vacuum energy, the growth is reduced by a factor of approx- 
imately J? - 65 ; for flat models with Q m + Q v = 1, the growth suppression is less 
marked - approximately J? 23 . 



Meszaros effect 

What about the case of collisionless matter in a radiation background? The fluid 
treatment is not appropriate here, since the two species of particles can interpen- 
etrate. A particularly interesting limit is for perturbations well inside the horizon: 
the radiation can then be treated as a smooth, unclustered background which affects 
only the overall expansion rate. The perturbation equation is as before 

5 + 2-5 = 4nGp m 5, (102) 

CL 

but now H 2 = 87rG(p m + p r )/3. If we change variable to y = p m / p r = a/a eq , then 
the equation becomes 

6» + -l±^y 6 ' - 5 (103) 
2y(l + y) 2y{l + y) 1 > 

(for k = 0, as appropriate for early times). It may be seen by inspection that a 
growing solution exists with 5" = 0: 



5 ocy + 2/3. 



(104) 



It is also possible to derive the decaying mode. This is simple in the radiation- 
dominated case (y <C 1): 5 oc — Iny is easily seen to be an approximate solution in 
this limit. 

What this says is that, at early times, the dominant energy of radiation drives 
the universe to expand so fast that the matter has no time to respond, and 5 is frozen 
at a constant. At late times, the radiation becomes negligible, and the growth picks 
up smoothly to the Einstein-de Sitter 5 oc a behaviour. The overall behaviour is 
therefore similar to the effects of pressure on a coupled fluid. For scales greater than 
the horizon, perturbations in matter and radiation can grow together; this growth 
ceases once the perturbations enter the horizon. However, the explanations are com- 
pletely different. In the fluid case, the radiation pressure prevents the perturbations 
from collapsing further; in the collisionless case, the photons have free-streamed 
away, and the matter perturbation fails to collapse only because radiation domi- 
nation ensures that the universe expands too quickly for the matter to have time 
to self-gravitate. Because matter perturbations enter the horizon with 5 > 0, 5 is 
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not frozen quite at the horizon-entry value, and continues to grow until this initial 
'velocity' is redshifted away, giving a total boost factor of roughly lny entry . This log 
factor may be seen below in the fitting formulae for the CDM power spectrum. 



Solutions for u 

To discuss velocities, go back to the basic equation for u: 



. , 2d 
u H u 

a 



g. 

a 



(105) 



g = —V5<3>/a is the peculiar gravitational acceleration, and pressure terms are ne- 
glected, so A > Aj. The peculiar velocity can be decomposed into modes either 
parallel or perpendicular to g. The latter are vorticity modes which decay. For 
the former, we know from the continuity equation (V • u = —5) that u = (5/5)u. 
Hence, the solution of the above equation for u has u oc g, and may be expressed as 



*v = ^g 

mo g ' 



(106) 



where the function f(f2) = (a/5)d5/da. A very good approximation to this (Peebles 
1980) is / ~ J? 6 . Alternatively, we can work in Fourier terms. This is easy, as g 
and k are parallel, so that V ■ u = iku. Thus, directly from the continuity equation, 




(107) 



The 1/k factor tells us that cosmological velocities come predominantly from large- 
scale perturbations. Deviations from the Hubble flow are therefore in principle a 
better probe of the inhomogeneity of the universe than large-scale clustering. 



4.2 Transfer functions 

There are in essence two ways in which the power spectrum which exists at early 
times may differ from that which emerges at the present, both of which correspond 
to a reduction of small-scale fluctuations: 

(i) Jeans' mass effects. Prior to matter-radiation equality, we have already seen 
that perturbations inside the horizon are prevented from growing by radiation pres- 
sure. This leads to an effective 'break' of An = 4 in the power spectrum at this 
point: 

4ocA"^+ 3 )/ 2 A>A 7 

(108) 

Once z eq is reached, one of two things can happen. If collisionless dark matter 
dominates, perturbations on all scales can grow. If baryonic gas dominates, the 
Jeans length remains approximately constant, as follows: The sound speed, c 2 s = 
dp/ dp, may be found by thinking about the response of matter and radiation to 
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small adiabatic compressions: 5p = (4/9)p r c 2 (5V/V), 5p = [p m + (4/3)p r ](5V/V), 
implying 

^- 2 ( 3+ ^) _1 = c2 ( 3+ ^) _1 - < 109 » 

Here, z ra d is the redshift of equality between matter and photons (l+z ra d = 1.68(1 + 
z eq ) because of the neutrino contribution). At z <C z ra d, we therefore have c s oc 
\Jl + z. Since p = (1 + z) 3 3f2 B HQ / (8t:G) , the comoving Jeans' length is constant at 

A '-wXw^^T^ Hsi - h2) ~ lu ^ (no) 

Thus, in either case, one of the critical length scales for the power spectrum will 
be the horizon distance at z eq (= 2b000f2h 2 for T = 2.7 K, counting neutrinos as 
radiation). In the matter-dominated approximation, we get 

d u = ^L(nz)- 1 ' 2 = 29 (Qh^Mpc. (Ill) 

The exact answer including radiation is a factor \/2 — 1 times this: 15.7 (i7/z 2 ) — 1 
Mpc. 

(ii) Damping. In addition to having their growth retarded, very small perturba- 
tion will be erased entirely, which can happen in one of two ways. For collisionless 
dark matter, perturbations are erased simply by free streaming: random particle 
velocities cause blobs to disperse. At early times (kT > mc 2 ), the particles will travel 
at c, and so any perturbation which has entered the horizon will be damped. This 
process switches off when the particles become non-relativistic; for massive particles, 
this happens long before z eq (Cold Dark Matter). For massive neutrinos, on the 
other hand, it happens at z eq : only perturbations on very large scales survive in the 
case of hot dark matter. In a pure baryon universe, the corresponding process 
is called Silk damping: the mean free path of photons due to scattering by the 
plasma is non-zero, and so radiation can diffuse out of a perturbation, convecting 
the plasma with it. The typical distance of a random walk in terms of the diffusion 
coefficient, D, is x ~ VDt, which gives a damping length of 



A s ~ (H2) 

- the geometric mean of the horizon size and the mean free path. Since A = 
l/(na T ) = 44.3(1 + z)~ 3 (f2 B h 2 )~ 1 proper Gpc, we obtain a comoving damping 
length of 

A s = 16.3 (1 + z)- 5 / 4 (f2lQh 6 )- 1/4 Gpc. (113) 

This becomes close to the Jeans' length by the time of last scattering, 1 + z ~ 1000. 

Real power spectra thus result from modifications of any primordial power by a 
variety of processes: growth under self-gravitation, effects of pressure and dissipative 
processes. In general, modes of short wavelength have their amplitudes reduced 
relative to those of long wavelength in this way. The overall effect is encapsulated in 
the transfer function, which gives the ratio of the late-time amplitude of a mode 
to its initial value. The detailed result can be hard to calculate, mainly because we 
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have a mixture of matter (both collisionless dark particles and baryonic plasma) and 
relativistic particles (collisionless neutrinos and collisional photons) which does not 
behave as a simple fluid. Particular problems are caused by the change in the photon 
component from being a fluid tightly coupled to the baryons by Thomson scattering, 
to being collisionless after recombination. Accurate results require a solution of 
the Boltzmann equation to follow the evolution in detail. The transfer function 
is thus a by-product of elaborate numerical calculations of microwave background 
fluctuations. Nevertheless, once we possess the transfer function, it is a most valuable 
tool. The evolution of linear perturbations back to last scattering obeys the simple 
relations summarised above, and it is easy to see how structure in the Universe will 
have changed during the matter-dominated epoch. 

It is thus invaluable in practice to have some accurate analytic formulae which fit 
the numerical results for transfer functions. We give below results for some common 
models in the form of the transfer function needed to produce a scale-invariant 
power spectrum at large wavelength, A 2 oc k 4 T^. For adiabatic models, T k is the true 
transfer function; for isocurvature models, this is not the case and T k is proportional 
to 1/k 2 times the true transfer function. We assume Q B <C J?, so that all lengths 
scale with the horizon size at matter-radiation equality, leading to the definition 
q = fc/(i?/i 2 Mpc _1 ). We consider the cases of (A) Adiabatic CDM; (B) Adiabatic 
massive neutrinos (1 massive, 2 massless); (C) Isocurvature CDM; these expressions 
come from Bardeen et al. (1986; BBKS). 



These models contain some hidden variables. Since the characteristic length-scale 
in the transfer function depends on the horizon size at matter-radiation equality, 
the temperature of the CMB enters. In the above formulae, it is assumed to be 
exactly 2.7 K; for other values, the characteristic wavenumbers scale oc T~ 2 . For 
these purposes massless neutrinos count as radiation, and three species of these 
contribute a total density which is 0.68 that of the photons. 

There is also the question of the baryon contribution: the above expressions 
assume pure dark matter, which is unrealistic. At least for CDM models, a non-zero 
baryonic density lowers the apparent dark-matter density parameter. We can define 
an apparent shape parameter for the transfer function, r*, where 



and r* = Qh in a model with zero baryon content. Peacock & Dodds (1994) showed 
that the effect of increasing f2 B was to preserve the CDM-style spectrum shape, but 
to shift to lower values of r*. This shift was generalized to models with Q ^ 1 by 
Sugiyama (1995): 



(A) T k 



(B) T k 

(C) T k 




(114) 



q = (/c//iMpc _1 )/r*, 



(115) 



r* = Qh exp[-^ B (l + 1/(2)]. 



(116) 
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4.3 N-body models 

This is a good place to discuss how to use the above equations of motion to carry 
out a nonlinear evolution of the density field. This is usually done by means of 
the N-body simulation, in which the density field is represented by the sum 
of a set of fictitious discrete particles. The equations of motion for each particle 
depend on solving for the gravitational field due to all the other particles, finding 
the change in particle positions and velocities over some small time step, moving 
and accelerating the particles, and finally re-calculating the gravitational field to 
start a new iteration. Using comoving units for length and velocity (v = cm), we 
have from above the equation of motion 

^u=-2^u-iv^ (117) 

at a a 2 

where $ is the Newtonian potential. The time derivative is already in the form of 
a total (or convective) derivative, as required for particle motions, rather than the 
partial d/dt. If we change time variable from t to a, this becomes 

d r 9 n a G sr~^ Xj — x , 11C > 

(118) 



cUna r UJ " H & ~ aH l^-x^ 

Here, the gravitational acceleration has been written exactly by summing over all 
particles, but this becomes prohibitive for very large numbers of particles. Since the 
problem is to solve Poisson's equation, a faster approach is to use Fourier methods, 
since this allows the use of the FFT algorithm. If the density perturbation field 
(not assumed small) is expressed as 5 = ^5fcexp[— zk • x], then Poisson's equation 
becomes —k 2 $k = 4:7tGa 2 p5k, and the required fc-space components of are just 

[V<% = -i$ fc k. (119) 

If we finally eliminate density in terms of S7, the equation of motion for a given 
particle is 

tt — [a 2 u] = Y] F fc ex P H k • x ] ; 

dlna ^ (12Q) 



Boxes and grids 

The efficient way of performing the required Fourier transforms is by averaging the 
data onto a grid and using the FFT algorithm, both to perform the transformation 
of density, and to perform the (three) inverse transforms to obtain the real-space 
force components from their /c-space counterparts. This leads to the simplest iV- 
body algorithm: the particle-mesh (PM) code. The only complicated part of the 
algorithm is the procedure for assigning mass to gridpoints, and interpolating the 
force as evaluated on the grid back onto the particles (for consistency, the same 
procedure must be used for both these steps). The most naive method is simply to 
bin the data: i.e. associate a given particle with whatever gridpoint happens to be 
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nearest. There are a variety of more subtle approaches (see Hockney & Eastwood 
1988; Efstathiou et al. 1985), but whichever strategy is used, the resolution of a 
PM code is clearly limited to about the size of the mesh. To do better, one can 
use a particle-particle-particle-mesh (P 3 M) code, also discussed by the above 
authors. Here, the direct forces are evaluated between particles in the neighbouring 
cells, with the grid estimate being used only for particles in more distant cells. A 
similar effect, although without the use of the FFT, is achieved by tree codes (e.g. 
Hernquist Bouchet & Suto 1991). 

In practice, however, the increase in resolution from these methods is limited to 
a factor of a few. This is because each particle in a cosmological A^-body simulation 
in fact stands for a large number of less massive particles. Close encounters of these 
spuriously large particles can lead, through three-body processes, to the formation of 
unphysical close massive binaries. To prevent this, the forces must be softened: set 
to a constant below some critical separation, rather than rising as 1/r 2 . If there are 
already few particles per PM cell, the softening must be some significant fraction of 
the cell size, so there is a limit to the gain over pure PM. For example, consider a box 
of side 50/i _1 Mpc, which is the smallest that can be used to simulate the observed 
universe without serious loss of power from the omitted long- wavelength modes. A 
typical size of calculation might use 128 3 particles on a Fourier mesh of the same 
size, so that the mean density is one particle per cell, and the cell size is of order that 
of the core of a rich cluster. To use such a simulation to study cluster cores means 
we are interested in overdensities of 10 3 - 10 4 , or typical interparticle separations 
of 0.05 - 0.1 of a cell. To avoid collisional effects, the pairwise interaction must be 
softened on this scale. A much larger improvement in resolution is only justified in 
regions of huge overdensity (~ 10 6 for a 100- fold increase in resolution over PM). 
The overall message is that A^-body simulations in cosmology are severely limited 
by mass resolution, and that this limits the spatial resolution that can be achieved 
while still modelling the evolution of the true collisionless fluid. 



From a practical point of view, it is convenient to change to a new set of units 
which incorporate the size of the computational box, and allow the simulation to be 
rescaled to different physical situations. Let the side of the box be L; it is clearly 
convenient measure length in terms of L and velocities in terms of the expansion 
velocity across the box: 



Since, for A" particles the density is p = Nm/ (aL) s , the mass of the particles and 
the gravitational constant can be eliminated and the equation of motion can be cast 
in an attractively dimensionless form: 



Units 



X = x/L 

U = 5v/(HLa) = u/HL. 



(121) 




(122) 
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The function f(a) is proportional to a 2 H(a), and has an arbitrary normalization - 
e.g. unity at the initial epoch. If the forces are instead evaluated on a grid, dimen- 
sionless wavenumbers K = k L are used, and the corresponding equation becomes 

^-[f(a)V] = £F*exp[-tK • X]; 

Fk = ~ % ^\% Q{a)f{a) 
Particles are now moved according to dx. = u dt, which becomes 

dX = Vd\na (124) 

in our new units. It only remains to set up the initial conditions; this is easy to do 
if the initial epoch is at high enough redshift that Q = 1, since then U oc a and the 
initial displacements and velocities are related by 

AX = U. (125) 

In the case where the initial conditions are specified late enough that Q is signifi- 
cantly different from unity, this can be modified by using the linear relation between 
density and velocity perturbations: for a given A, the corresponding velocity scales 
as J? 0,6 (see above). It is not in fact critical that the density fluctuations be very 
small at this time: this is related to a remarkable approximation for nonlinear dy- 
namics due to Zeldovich. 



(123) 



4.4 Hierarchical density fields 

We have seen that the perturbations which survive recombination are of two dis- 
tinct classes: in some (such as cold dark matter), primordial fluctuations survive on 
very small scales (small-scale damping); in other cases, such as hot dark matter 
or adiabatic baryons, the perturbation field is dominated by fluctuations on scales 
~ the horizon at z eq (large-scale damping). The consequences for galaxy forma- 
tion are radically different. In the former case, non-linear collapse of sub-galactic 
mass units can be the first event which occurs after recombination. These will then 
cluster together in a hierarchy, forming successively more massive systems as time 
progresses. Hierarchies are also known as bottom-up pictures for galaxy forma- 
tion. Conversely, in large-scale damping, pancakes of cluster or supercluster size are 
the first structures to form. Galaxies must be presumed to form through dissipative 
processes occurring in the shocked gas which forms at pancake collapse (top down). 

As described, it is clear that top-down pictures are unappealing in that the 
physics of galaxy formation is likely to be very complex and messy. Reality is often 
like this: the formation of stars is a good example of a fundamental process where 
is is hard to understand what is going on. In contrast, the computational simplic- 
ity of hierarchies has led to much more detailed work being performed on them. 
Theoretical prejudice aside, however, the Universe looks like a hierarchy, displaying 
many small groups of galaxies (e.g. the Milky Way's own Local Group), which exist 
within superclusters which are only mildly nonlinear. 

It may seem that such a situation cannot be analysed within the bounds of lin- 
ear theory, but a way forward was identified by Press & Schechter (1974; PS). The 
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critical assumption in the PS analysis is that, even if the field is non-linear, the 
amplitude of large- wavelength modes in the final field will be close to that predicted 
from linear theory. For this to be true requires the 'true' large-scale power to exceed 
that generated via non-linear coupling of small-scale modes, which turns out to re- 
quire a spectral index n < 1 (Williams et al. 1991a). We now proceed by recognising 
that, for a massive clump to undergo gravitational collapse, the average overdensity 
in a volume containing that mass should (as usual) exceed some threshold, 5 C , of 
order unity. The location and properties of these bound objects can thus be esti- 
mated by an artificial smoothing (or filtering) of the initial linear density field. If 
the filter function has some characteristic length Rf, then the typical size of filtered 
fluctuations will be ~ Rf and they can be assigned a mass M ~ Po-R/- The exact 
analytic form of the filter function is arbitrary and is often taken to be a Gaussian 
for analytic convenience. 

The argument now proceeds in integral terms. For a given Rf, the probability 
that a given point lies in a region with 5 > 5 C (the critical overdensity for collapse) 
is 



where cr(Rf) is the linear rms in the filtered version of 5. The PS argument now 
takes this to be proportional to the probability that a given point has ever been 
processed through a collapsed object of scale > Rf - This is really assuming that the 
only objects which exist at a given epoch are those which have just collapsed: if a 
point has 5 > 5 C for a given Rf, then it will have 5 = 5 C when filtered on some larger 
scale and will be counted as an object of the larger scale. The problem with this 
argument is that half the mass remains unaccounted for: this was amended by PS 
simply by multiplying the probability by a factor 2. Note that this procedure need 
not be confined to Gaussian fields; all we need is the functional form of p(8 > 5 C \Rf). 
The analogue of the factor of 2 problem remains: what to do with the points having 
5 < 0. 

This integral probability is related to the mass function f(M) (defined such that 
f(M)dM is the comoving number density of objects in the range dM) via 



where po is the total comoving density. Thus, 



M 2 f(M) 2S C 


din a 


exp(-i5 c 2 /a 2 ). 


Po a 


dlnM 



(128) 



We have expressed the result in terms of the multiplicity function: M 2 f(M)/p 
is the fraction of the mass which is carried by objects in a unit range of InM. For 
power-law spectra, this function takes a very simple form: 




(126) 



Mf(M)/p = \dp/dM\, 



(127) 




f(M) n + 3 




(129) 
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Where v is the threshold in units of the rms density fluctuation. The multiplicity 
function thus always has the same shape (a skew-negative hump around v ~ 1); 
changing the spectral index only alters the mass scale via v = (M/M c )( n+S ^ 6 . 



Random walks and conditional mass functions 

The factor of 2 'fudge' has long been recognized as the crucial weakness of the PS 
analysis. What one has in mind is that the mass from lower-density regions accretes 
onto collapsed objects, but it does not seem correct for this to cause a doubling 
of the total number of objects. Recent work has shed some light on the origin of 
this problem (Peacock & Heavens 1990; Bond et al. 1991). To see where the error 
crept in, consider the random trajectory taken by the filtered field at some fixed 
point as a function of filtering radius. This starts at 5 = at R = oo, and develops 
fluctuations of increasing amplitude as we move to smaller R. Thus, if 5 < 5 C at a 
given point, it is quite possible that it will exceed the threshold at some other point 
- indeed, if the field variance diverges as R — > 0, it is inevitable that the threshold 
will be exceeded. So, instead of ignoring all points below threshold at a given R, we 
should find the first upcrossing of the random trajectory: the largest value of R 
for which 6 = 5 C . 

This analysis is most easily performed for one particular choice of filter: sharp 
truncation in /c-space. Decreasing R then corresponds to adding in new /c-space 
shells, all of which are independent for a Gaussian field. The trajectory is then just 
a random walk, and the solution is very easy. Consider a point on the walk which has 
reached the threshold; its subsequent motion will be symmetric, so that it is equally 
likely to be found above the threshold as below at some smaller R. The probability 
of never having crossed the threshold (the survival probability) is then obtained 
by reflection of the Gaussian above threshold: 



dP, 1 



d5 ^/2ti a 



exp — — j — exp 



2a 2 ; ^ V 2a 



(130) 



Integrating this up to get the probability of having crossed the threshold at least 
once gives 



1 - p ' = 1 - ei( {vt)' (131) 

which is just twice the unconstrained probability of lying above the threshold - thus 
supplying the missing factor of 2. Unfortunately, the above analysis is only valid for 
this special choice of filter: using /c-space filters which are differentiable leads to just 
the original PS form (without the factor 2) at high mass, with the surplus probability 
being shifted to low masses, so that the shape of the function changes. Which filter 
is the best choice, we cannot say in advance; we are left with the empirical fact that 
the PS formula does fit iV-body results quite well. 

One useful extension of the random- walk model is that it allows a calculation of 
the conditional multiplicity function: given a particle in a system of mass M 
at some epoch ao, what was the distribution of masses where that particle resided 
at some earlier epoch ai? This is now just the random walk with two absorbing 
barriers, at 5 c /cio and 5 c /cii; we want the probability of not having crossed the 
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second, subject to having crossed the first at M . The solution for the integral mass 
distribution is 



P(> fj) = 1 - erf 



Ht- 1 - 1) 



(132) 



where v = 5 c /aocro; * = ai/a>o', A* = o" 2 (-^o)/o" 2 (-^)- This function tells us that 
the early histories of particles which end up in different mass systems are markedly 
different over a large range of expansion factor. This may provide a way of un- 
derstanding many of the systematics of galaxy systems in terms of their merger 
histories: see Bower (1991); Lacey & Cole (1993). 

The way to deduce a merger rate from the above is to consider the limit of the 
conditional mass function at two nearly equal epochs: the time derivative of the 
conditional function must be related to the merger rate. First, we need to invert 
the above reasoning, which gives P(Mi,z \ M ,z = 0). Conditional probability 
definitions then imply 



P(M | Mi) = 



P(M 1 | M ) P(M ) 
P{M 1 ) 



(133) 



We now have the probability that an object of mass Mi at redshift z becomes 
incorporated into one of mass Mq at redshift z = 0. The merger rate A4, at which 
an object of mass Mq accretes objects of mass AM, is then 



M{M ,AM) = 



d 2 P(M | Mi) 



dM dt 



(134) 



tl— »to 



These analytical descriptions of how mass gathers into clumps in hierarchical 
clustering have been shown to describe the results of numerical simulations ex- 
tremely well (Bond et al. 1991) and provide a powerful tool for understanding the 
growth of cosmological structure. For example, it is now possible to answer the 
apparent paradox of the simple Press- Schechter theory, where all collapsed objects 
considered are those which formed only at the instant considered {i.e. sit exactly at 
the threshold 8 = 8 C ), even though common sense says that some objects must have 
formed at high redshift and survived unchanged to the present day. The conditional 
mass function indeed allows us to define something close to the typical formation 
epoch for a clump, which we might take as the time when the probability is 0.5 that 
the precursor mass is at least half the final mass. The conditional mass function says 
that this time depends on the final mass of the clump; if it is 3> M* , the formation 
epoch will be in the recent past, whereas low-mass clumps survive for longer. As a 
specific example, this calculation implies that massive clusters form extremely late 
in an Einstein-de Sitter model. About 30% of Abell clusters will have doubled their 
mass since as recently as z = 0.2 (as against only 5% if Q = 0.2). The observed 
frequency of substructure in clusters is used on this basis by Lacey & Cole to argue 
that Q > 0.5 (although lower Q is allowed in models with vacuum energy, since 
what matters for this analysis is the linear growth suppression factor). 
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Application to galaxy clusters 

The most important practical application of the PS formalism is to rich clusters. 
As already discussed, these are the most massive non-linear systems in the current 
Universe, so a study of their properties should set constraints on the shape and 
normalization of the power spectrum on large scales. These issues are discussed 
by e.g. Henry & Arnaud (1991), who sidestep the issue of what mass to assign 
to a cluster by using the observed distribution of temperatures (see below for the 
relation between mass and virial temperature). Fitting their data with the PS form 
for top-hat filtering and S c — 1.69 they deduce a linear-theory rms in 8/i _1 Mpc 
spheres of a 8 = 0.59 ± 0.02 for Q = 1. Similarly, White, Efstathiou & Frenk (1993) 
deduce a 8 = 0.57 ± 0.06, although they rely on the central masses of Abell clusters 
being known when calibrating this number. These calculations have been checked 
against numerical simulations, so the result therefore seems rather robust and well- 
determined. For models with low density, the required normalization rises: for a lower 
mean density, more collapsed systems are needed to yield the observed number of 
objects of a given mass. The approximate scaling with Q is approximately the same 
as is required in the cosmic virial theorem to keep velocity dispersions constant 
(reasonably enough, given that these determine the cluster masses): 



This knowledge of a% as a measure of the true mass inhomogeneity is a fundamental 
cosmological datum. 

4.5 Galaxy formation 
Cooling and galaxy formation 

This discussion of mass functions really applies only to 'haloes' of collisionless dark 
matter. Things are more complex for baryonic matter, where we must ask if the 
matter has been able to dissipate and turn into stars. This question was analyzed 
in a classic paper by Rees & Ostriker (1977), and has been reconsidered in the 
context of CDM by Blumenthal et al. (1984). 

Mergers will heat gas up to the virial temperature via shocks; in order for the 
gas to form stars, it must be able to undergo radiative cooling - to dispose of this 
thermal energy. Clearly, the redshift of collapse clearly needs to be sufficiently large 
that there is time for an object to cool between its formation at redshift z coo \ (when 
Sp/p ~ S c ) and the present epoch. We shall show below that z coo \ is a function 
of mass; it is therefore possible to put cooling into the Press-Schechter machinery 
simply by using the mass-dependent threshold u(M) = S c [l + z CO oi(M )]/o"o(M) in 
the mass function. 

The cooling function for a plasma in thermal equilibrium has been calculated by 
Raymond, Cox & Smith (1976). For an H + He plasma with Y = 0.25 and some 
admixture of metals, their results for the cooling time (t coo i = 3kT/2A(T)n) may 
be approximated as roughly 



<7 8 ~ (0.5-0.6) Q 



-0.5 



(135) 




-l 



T 8 1/2 + 0.5/ m T 8 3/2 ) \ (136) 



^cool/years = 1.8 x 10 



,24 



PB 



M Mpc 



-3 
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where T 8 = T/10 8 K. The T -1 / 2 term represents bremsstrahlung cooling and the 
T -3 / 2 term approximates the effects of recombination radiation. The parameter 
f m governs the metal content: f m = 1 for solar abundances; f m ~ 0.03 for no 
metals. In this model where so far dissipation has not been considered, the baryon 
density is proportional to the total density, the collapse of both resulting from purely 
gravitational processes. p B is then a fraction Q^jQ of the virialized total density. 
This is itself some multiple f c of the background density at virialization (which we 
refer to as 'collapse'): 



Pc = fcPo (1 + z c y 



(137) 



The virialized potential energy for constant density is 3GM 2 / (5r), where the radius 
satisfies 4irp c r 3 /3 = M. This energy must equal 3MkT/ (/xm p ), where fx = 0.59 for a 
plasma with 75% Hydrogen by mass. Hence, using p 
we obtain 



2.78 x 10 n J?/i 2 M Mpc" 



T viTial /K = lO^tM/lO^Mo) 2 / 3 (f c (2h 2 ) 1/3 (1 + z c ). 



(138) 



So, for Q = 1, we must solve f t t coo \ = |i7 1 [1 — (1 + z c ) 3 / 2 ] . If only recombination 
cooling was important, the solution to this would be 

(l + z c ) = (l + M/M cool ) 2 / 3 

m coo1 /m = io 13 - 1 zr7„j c 1/2 ^- 1/2 

For high metallicity, where bremsstrahlung only dominates at T > 10 8 _ftT, this equa- 
tion for z c will be a reasonable approximation up to z c ~ 10, at which point Compton 
cooling will start to operate. Given that we expect at least some enrichment rather 
early in the progress of the hierarchy, we shall keep things simple by using just the 
above expression for z c . 

We see that cooling is rapid for low masses, where the luminous and dark mass 
functions are expected to coincide. Given that cooling of massive objects is inef- 
fective, probability in the mass function must therefore accumulate at intermediate 
masses: the numbers of faint galaxies relative to bright are decreased. If M coo i M c , 
then there is a power-law region between these two masses which differs from the 

PS slope: M 2 f(M) oc M(^) + i; i.e. there is an effective change in n to n + 4. 
This may be relevant for the galaxy luminosity function, where the faint-end slope 
is close to constant numbers per magnitude. For constant mass-to light ratio, this 
implies M 2 f(M) oc M, and apparently requires n = 3. Alternatively, a spectral 
index more in accord with large-scale structure observations of n < —1 gives a 
Press-Schechter slope much steeper than the observed galaxy luminosity function. 
There have been many full and complicated studies of galaxy formation that show 
in detail why this is not really as much of a paradox as it initially seems, but the 
above simplified discussion of cooling illustrates the main way in which the naive 
analysis goes wrong. 
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5 Testing inflation against galaxy clustering 



5.1 Clustering statistics 

Fourier analysis of density fluctuations 

It is often convenient to consider building up a general field by the superposition 
of many modes. For a flat comoving geometry, the natural tool for achieving this 
is via Fourier analysis. For other models, plane waves are not a complete set and 
one should use instead the eigenfunctions of the wave equation in a curved space. 
Normally this complication is neglected: even in an open Universe, the difference 
only matters on scales of order the present-day horizon. 

How do we make a Fourier expansion of an infinite density field? If the field was 
periodic within some box of side L, then we would just have a sum over wave modes: 



F(x) = ^F k e" 



ik •> 



(140) 



Now, if we let the box become arbitrarily large, then the sum will go over to an 
integral which incorporates the density of states in /c-space - exactly as in statistical 
mechanics: The Fourier relations in n dimensions are thus 



F(x) = (J^J J F k (k) exp -ik • x d n k 
F k (k) = (J^J J F(x) expzk • x d n x. 



(141) 



One advantage of this particular Fourier convention is that the definition of convo- 
lution is just a simple volume average, with no gratuitous factors of (27r) -1 / 2 : 



f*9=^ J f(*-y)g(y)d n y. 



(142) 



Although one can make all manipulations on density fields which follow using either 
the integral or sum formulations, it is usually easier to use the sum. This saves having 
to introduce 5-functions in /c-space. For example, if we have / = f k exp(— ikx), 
the obvious way to extract f k is via f k = (1/L) j f exp(ikx) dx: because of the 
harmonic boundary conditions, all oscillatory terms in the sum integrate to zero, 
leaving only f k to be integrated from to L. There is less chance of committing 
errors of factors of 2n in this way than considering / = (L/2n) J f k exp(—ikx) dk 
and then using f exp[i(k — K)x] dx = 2Tvd D (k — K). 



Correlation functions and power spectra 

As an immediate example of the Fourier machinery in action, consider the important 
quantity 

£(r) = <<y(x)<S(x + r)), (143) 

which is the autocorrelation function of the density field - usually referred to simply 
as the correlation function. The angle brackets indicate an averaging over the 
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normalization volume V. If we express 5 as a sum, and note that reality means we 
can replace one of the two <Ts by its complex conjugate, then we obtain 



e i(k'-k)-x e -ik-r 



(144) 



An alternative way of obtaining this is to use the relation between modes with 
opposite wavevectors which holds for any real field: 5k(— k) = <5 k (k). By the periodic 
boundary conditions, however, all the cross terms with k' ^ k average to zero. 
Expressing the remaining sum as an integral, we have 



(145) 




In short, the correlation function is the Fourier transform of the power spectrum. 
We shall hereafter often use the alternative notation P(k) = \5k\ 2 - 

Now, in an isotropic universe, the density perturbation spectrum should con- 
tain no preferred direction, and so we must have an isotropic power spec- 
trum: 1 5k | 2 (k) = \5k\ 2 (k). We can therefore perform the angular integral: intro- 
duce spherical polars with the polar axis along k, and use the reality of £ so that 



ikx 



cos (kr cos 9). In three dimensions, this yields 



£(r) 



V 



(2tt) 3 

The 2D analogue of this formula is 

A 



j\6 k 



2 sin/cr 2 
kr 



£(r) = 



(2^ 



\S k \ 2 J {kr) 2nk dk. 



(146) 
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We shall usually express the power spectrum in dimensionless form, as the vari- 
ance per Ink (A 2 = da 2 /dlnk oc k 3 P[k}): 



(148) 




This gives a more easily visualisable meaning to the power spectrum than does the 
quantity VP(k), which has dimensions of volume: A 2 (k) = 1 means that there are 
order unity density fluctuations from modes in the logarithmic bin around wavenum- 
ber k. 



Power-law spectra 

The above shows that the power spectrum is a vital quantity in cosmology, but how 
can we predict its functional form? For decades, this was thought to be impossible, 
and so a minimal set of assumptions was investigated. In the absence of a physical 
theory, we should not assume that the spectrum contains any preferred length scale, 
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otherwise we should then be compelled to explain it. This means that the spectrum 
must be a featureless power law: 



\5 k \ 2 ock n 



(149) 



The index n governs the balance between large- and small-scale power. The meaning 
of different values of n can be seen by imagining the results of filtering the density 
field by passing over it a box of some characteristic size, x, and averaging the density 
over the box. This will filter out waves with k > 1/x, leaving a variance (5 2 ) oc 

Jq k n Ank 2 dk oc ;r~( n+3 ). Hence, in terms of a mass M oc x s : we have 

cWs oc M"^+ 3 )/ 6 . (150) 

Similarly, a power-law spectrum implies a power-law correlation function. If 
£(r) = (r/r ) -7 , the corresponding 3D power spectrum is 

A 2 (k) = - (At ) 7 r(2 - 7 ) sin (2 ~ l)n = (3(kr y (151) 

TV Z 

(= 0.903(/cr ) 1 ' 8 if 7 = 1.8). This expression is only valid for n < (7 < 3); for 
larger values of n, £ must become negative at large r (because P(0) must vanish, 
implying J °° £(r) r 2 dr = 0). A cutoff in the spectrum at large k is needed to obtain 
physically sensible results. 

What general constraints can we set on the value of n? Asymptotic homogeneity 
clearly requires n > —3. An upper limit of n < 4 comes from an argument due 
to Zeldovich. Suppose we begin with a totally uniform matter distribution and 
then group it into discrete chunks as uniformly as possible. It can be shown that 
conservation of momentum in this process means that we cannot create a power 
spectrum which goes to zero at small wavelengths more rapidly than 5k oc k 2 . Thus, 
discreteness of matter produces the minimal spectrum: n = 4. 

More plausible alternatives lie between these extremes. The value n = cor- 
responds to white noise: the same power at all wavelengths. This is also known 
as the Poissonian power spectrum, because it corresponds to fluctuations between 
different cells which scale as 1/ V M ce \\ (see below) . A density field created by throw- 
ing down a large number of point masses at random would therefore consist of white 
noise. Particles placed at random within cells, one per cell, create an n = 2 spec- 
trum on large scales. Practical spectra in cosmology, conversely, often have negative 
effective values of n over a large range of wavenumber. In this sense, large-scale 
structure is much more 'real' than the simple white-noise fluctuations familiar in 
other contexts. 



The Zeldovich spectrum 

Most important of all is the scale-invariant spectrum, which corresponds to the 
value n = 1, i.e. A 2 oc k 4 . To see where the name arises, consider perturbations in 
gravitational potential: 



V 2 5@ = AnGpoS =>■ 5@ k = -AnG Po 5 k /k 2 . 



(152) 
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The two powers of k pulled down by V 2 mean that, if A 2 oc k 4 for matter, then 
8% is a constant. Since potential perturbations govern the flatness of spacetime, this 
says that the scale-invariant spectrum corresponds to a metric which is fractal-like: 
it has the same degree of 'wrinkliness' on each resolution scale. The total curvature 
fluctuations diverge, but only logarithmically at either extreme of wavelength. 

Another way of looking at this spectrum is in terms of perturbation growth 
balancing the scale-dependence of 5: 5 oc x~( n+3 ^ 2 . We know that 5 viewed on 
a given co moving scale will increase with the size of the horizon: 5 oc r 2 . At an 
arbitrary time, though, the only natural length provided by the Universe (in the 
absence of non-gravitational effects) is the horizon itself: 

5(r H )ocr- (n - 1)/2 . (153) 

Thus, if n = 1, the growth of r H and 5 with time cancel out so that the universe 
always looks the same when viewed on the scale of the horizon; such a universe 
is a fractal in the sense of always appearing the same under the magnification of 
cosmological expansion. This spectrum is often known as the Zeldovich spectrum 
(sometimes hyphenated with Harrison and Peebles, who also invented it indepen- 
dently) . 



Filtering and moments 



A common concept in the manipulation of cosmological density fields is that of 
filtering: convolution of the density field with some window function: 5 — > 5 * f . 
Many observable results can be expressed in this form. Some common 3D filter 
functions are 



Gaussian : / 



V 



-r 2 /2R 2 G 



fk 



-k 2 R G /2 



(2n) 3 / 2 R G 

3V 3 
Top - hat : / = j—^ (r < R T ) =>- f k = — [sin y - y cos y] (y = kR T ) 



(154) 



Note the factor of V in the definition of /; this is needed to cancel the 1/V in the 
definition of convolution. For some power spectra, the difference in these filter func- 
tions at large k is unimportant, and we can relate them by equating the expansions 
near k = 0, where 1 — \fk\ 2 oc k 2 . This equality requires 



-R T = y/h R c 



(155) 



We are often interested not in the convolved field itself, but in its variance, for 
use as a statistic {e.g. to measure the rms fluctuations in the number of objects in 
a cell). By the convolution theorem, this means we are interested in a moment of 
the power spectrum times the squared filter transform. We shall generally use the 
following notation: 




(156) 
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the filtered variance is thus a 2 , (which we shall often denote by just a 2 ). Moments 
may also be expressed in terms of the correlation function over the sample volume: 

a 2 = - x'|) /(x) /(x') d 3 x d 3 x'. (157) 

To prove this, it is easiest to start from the definition of a 2 as an integral over the 
power spectrum times \fk\ 2 , write out the Fourier representations of P and //-, and 

use / expik ■ (x — x' + r) d s k = (27t) 3 5d (x — x' + r). Finally, it is also sometimes 
convenient to express things in terms of derivatives of the correlation function at 
zero lag. Odd derivatives vanish, but even derivatives give 

e (2n) (0) = (-l) n 2^T- (158) 



Normalization 

For scale-invariant spectra, a natural amplitude measure is the (constant) gravita- 
tional potential variance per unit Ink: 

^w^'iS)" 4 ^ (159) 

Two more commonly encountered measures relate to the clustering field around 
10 Mpc. One is as'- the rms density variation when smoothed in spheres of radius 
8/i _1 Mpc; this is observed to be very close to unity. The other is an integral over 
the correlation function: 

Js = J*Z(y) y 2 dy = J A 2 (k)W(k) * (160) 

where W(k) = (sin kr—kr cos kr)/k 3 . The canonical value of this is J3(10/i -1 Mpc) = 
277/i -3 Mpc (from the CfA survey: Davis & Peebles 1983). It is sometimes more 
usual to use instead the dimensionless volume-averaged correlation function £: 

£(x) 4ttx 2 dx = ^Js(r). (161) 

The canonical value then becomes ^(10/i _1 Mpc) = 0.83; this measure is clearly very 
close in content to as = 1. 

A point to beware of is that the normalization of a theory is often quoted in 
terms of a value of these parameters extrapolated according to linear time evolution. 
Since the observed values are clearly non-linear, there is no reason why these two 
definitions should match exactly. Even more confusingly, it is quite common in the 
literature to find the linear value of as called 1/6, where b is a bias parameter. 
The implication is that b ^ 1 means that light does not follow mass; this may well 
be true in reality, but with this definition, nonlinearities will produce b ^ 1 even in 
models where mass traces light exactly. Use of this convention is not recommended. 
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5.2 Non-linear evolution 

How is the power spectrum altered by nonlinear evolution? As with most nonlinear 
questions, this cannot be answered analytically in general, but some aspects of the 
scaling of the problem are well understood. 

A useful trick is to think of the density field under full nonlinear evolution as 
consisting of a set of collapsed, virialized clusters. What is the density profile of 
one of these objects? At least at separations smaller than the clump separation, the 
density profile of the clusters will be of the same form as the correlation function, 
since this just measures the number density of neighbours to a given galaxy. Thus, 
a power-law correlation £(r) oc r -7 may be thought of as arising from the p oc r -7 
haloes of clumps in the density field. 

In this picture, it is easy to see how £ will evolve with redshift, since clusters 
are virialized objects which do not change as the universe expands. We have stable 
clustering: £ is fixed in proper terms apart from a (1 + z)~ 3 scaling owing to the 
changing mean density of unclustered galaxies which dilute the clustering at high 
redshift. Thus, with £ oc r -7 , we obtain the comoving evolution 

f (r, z) oc (1 + z) 1 ' 3 (non - linear). (162) 

Since the observed 7 ~ 1.8, this implies slower evolution than is expected in the 
linear regime: 

f (r, z) oc (1 + z)~ 2 (linear). (163) 

This argument does not so far give a relation between the non-linear slope 7 and the 
index n of the linear spectrum. However, these two rates of evolution must match 
to give the same prediction for the evolution of ro - the length-scale of nonlinearity 
- since this is where £ will break from the linear £ oc 7- _ ( n + 3 ) to the nonlinear 
£ oc r -7 . The linear and non-linear predictions for the evolution of ro are respectively 
r oc (1 + ^)" 2 /( n + 3 ) and r oc (1 + z)-^ 3 ' 1 ^ 1 , so that 7 = (3n + 9)/(n + 5), or in 
terms of an effective index 7 = 3 + n e fj : 



(164) 



The power spectrum resulting from power-law initial conditions will evolve self- 
similarly with this index. Note the narrow range predicted: —2 < n e g < —1 for 
— 2 < n < +1, with an n = — 2 spectrum having the same shape in both linear and 
nonlinear regimes. 

Whether this evolution has been seen or not is presently controversial. Efstathiou 
et al. (1991) have observed a very low amplitude for the angular clustering of galaxies 
at B ~ 26, inferring that (if Q = 1) the clustering must evolve very rapidly - at 
about the linear-theory rate. However, their fields are very small and it is possible 
their small result is not representative. An indication that clustering may not decline 
this rapidly is given by the observed clustering of quasars at z ~ 1; Shanks & Boyle 
(1994) find the relatively high value ro = 7/i -1 Mpc. 

For many years it was thought that only the limiting cases of extreme linearity 
or nonlinearity could be dealt with analytically, but in a marvelous piece of alchemy, 
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Hamilton et al. (1991; HKLM) gave a universal analytical formula for accomplishing 
the linear <-> nonlinear mapping. The conceptual basis of their method can be under- 
stood with reference to the spherical collapse model. For Q = 1 (the only case they 
considered), a spherical clump virializes at a density contrast of order 100 when the 
linear contrast is of order unity. The trick now is to think about the density contrast 
in two distinct ways. To make a connection with the statistics of the density field, 
the correlation function £(r) may be taken as giving a typical clump profile. What 
matters for collapse is that the integrated overdensity reaches a critical value, so one 
should work with the volume-averaged correlation function £(r) . A density contrast 
of 1 + 5 can also be thought of as arising through collapse by a factor (1 + 5) 1 ^ 3 in ra- 
dius, which suggests that a given non-linear correlation £ NL (r NL ) should be thought 
of as resulting from linear correlations on a linear scale 

r L = [l + U(r NL )] 1/3 r NL . (165) 

This is one part of the HKLM procedure. The second part, having translated scales 
as above, is to conjecture that the nonlinear correlations are a universal function of 
the linear ones: 

U(r NL ) =iWUr L )]. (166) 

The asymptotics of the function can be deduced readily. For small arguments i < 1, 
/ NL (x) ~ x; the spherical collapse argument suggests / N l(1) — 10 2 . Following col- 
lapse, £ NL depends on scale factor as a 3 (stable clustering), whereas £ L oc a 2 ; the 
large- ;r limit is therefore f^(x) oc x 3 ^ 2 . HKLM deduced from numerical experi- 
ments a numerical fit that interpolated between these two regimes, in a manner 
that empirically showed negligible dependence on power spectrum. 

To use this method with power spectra, we can use the relations between £(r) 
and f (r) 

^3°-Ml ( 167 ) 
U > d[r 3 ] ' 

followed by the Fourier relations between £(r) and A 2 (k), to obtain 



f°° dk 3 

£(r) = / A 2 (k) — — — g [sin kr — kr cos kr] 
Jo k (kr) 

2k 3 f°° - 1 

A 2 (k) = / £(r) r 2 dr -——[sin kr — kr cos kr], 

3tt J (kr) 



(168) 



where the last relation holds provided that £(r) — > faster than r~ 2 at large r (i.e. 
a spectrum which asymptotically has n > — 1, a valid assumption for spectra of 
practical interest). 

However, these equations are often difficult to use stably for numerical evalua- 
tion; it is better to work directly in terms of power spectra. The key idea here is that 
£(r) can often be thought of as measuring the power at some effective wavenumber: 
it is obtained as an integral of the product of A 2 (k), which is often a rapidly rising 
function, and a window function which cuts off rapidly at k > 1/r: 
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£(r) = A 2 (k eS ) 
k eS ~ 2/r, 



(169) 



where n is the effective power-law index of the power spectrum. This approximation 
for the effective wavenumber is within 20 per cent of the exact answer over the range 
—2 < n < 0. In most circumstances, it is therefore an excellent approximation to 
use the HKLM formulae directly to scale wavenumbers and powers: 



4(M = U[A 2 M) 
K = [i + z\L(^ L )]- 1/3 ^ 



(170) 



Even better, it is not necessary that the number relating 1/r and fc e ff be a constant 
over the whole spectrum. All that matters is that the number can be treated as 
constant over the limited range r NL to r L . This means that the deviations of the 
above formulae from the exact transformation of the HKLM procedure are only 
noticeable in cases where the power spectrum deviates markedly from a smooth 
monotonic function, or where either the linear or nonlinear spectra are very flat 
(n<-2). 

What about models with Q ^ 1? The argument that leads to the f Nh (x) oc 
x 3 / 2 asymptote in the nonlinear transformation is just that linear and nonlinear 
correlations behave as a 2 and a 3 respectively following collapse. If collapse occurs 
at high redshift, then Q = 1 may be assumed at that time, and the nonlinear 
correlations still obey the a 3 scaling to low redshift. All that has changed is that 
the linear growth is suppressed by some ^-dependent factor g(f2). It then follows 
that the large-x asymptote of the nonlinear function is 

/ NL (o:)oc[^)]- 3 x 3 / 2 . (171) 

According to Carroll, Press & Turner (1992), the required growth-suppression factor 
may be approximated almost exactly by 

g(Q) = \Q m \qU 7 - tf« + (1 + tfm/2)(l + tf„/70)] , (172) 

where we have distinguished matter (m) and vacuum (y) contributions to the density 
parameter explicitly. 

Peacock & Dodds (1996) suggested the following generalization of the HKLM 
method, using the following fitting formula for the nonlinear function (strictly, the 
one which applies to the power spectrum, rather than to £): 



l + B/3x+ [Ax] a(3 ll/f3 
l + dAx^g^/iVx 1 / 2 ])^ 



(173) 



B describes a second-order deviation from linear growth; A and a parameterise 
the power-law which dominates the function in the quasilinear regime; V is the 
virialization parameter which gives the amplitude of the f Nh (x) oc x 3 / 2 asymptote; 
(3 softens the transition between these regimes. 
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Fig. 3. The generalization of the HKLM function relating nonlinear power to linear power, 
for the cases n = 0, —1 and —2, and with Q = 1 and 0.2 (dotted lines). The nonlinear 
power increases for lower Q and for more negative n, but in a nearly universal way for 
n > — 1. The fitting formula is shown for models with zero vacuum energy only, but what 
matters in general is just the J?-dependent linear growth suppression factor. 

HKLM's suggestion was that / NL might be independent of the form of the linear 
spectrum, but Jain, Mo & White (1995) showed that this is not true, especially when 
the linear spectrum is rather flat (n < —1.5). Peacock & Dodds (1996) find that an 
excellent fit (illustrated in Figure 3) is given by the following spectrum dependence 
of the expansion coefficients: 

(174) 

(175) 
(176) 
(177) 
(178) 

The more general case of curved spectra such as CDM can be dealt with very well 
by using the tangent spectral index at each linear wavenumber: 

s rftai (179) 

Note that the cosmological model does not enter anywhere in these parameters. It 
is present in the fitting formula only through the growth factor g, which governs the 
amplitude of the virialized portion of the spectrum. This says that all the quasilinear 
features of the power spectrum are independent of the cosmological model, and only 
know about the overall level of power. This is not surprising to the extent that 
quasilinear evolution is well described by the Zeldovich approximation, in which the 
final positions of particles are obtained by extrapolating their initial displacements 
by some universal time-dependent factor. 
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= 0.542 (l + n/3) - 


-0.685 


B 


= 0.097 (l + n/3)" 


-0.224 


a 


= 3.235 (l + n/3) 


-0.236 


P 


= 0.659(1 + n/3)~ 


-0.356 


V 


= 11.54 (l + n/3)" 


-0.371 
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5.3 Redshift-space effects 

Although a huge amount of astronomical effort has been invested in galaxy redshift 
surveys, with the aim of mapping the three-dimensional galaxy distribution, the 
results are not a true 3D picture. The radial axis of redshift is modified by the 
Doppler effects of peculiar velocity: 1 + z — > (1 + z)(l + v/c). Since the peculiar 
velocities arise from the clustering itself, the apparent clustering pattern in redshift 
space, where redshift is assumed to arise from ideal Hubble expansion only, differs 
systematically from that in real space. 

The distortions caused by working in redshift space are relatively simple to anal- 
yse if we assume we are dealing with a distant region of space which subtends a small 
angle, so that radial distortions can be considered as happening along one Cartesian 
axis. In this case, the apparent amplitude of any linear density disturbance is readily 
obtained from the usual linear relation between displacement and velocity (Kaiser 
1987) 



Oobs = o I H ^— 



where // is the cosine of the angle between the wavevector and the line of sight 
(p = f ■ k) . The parameter b allows for bias: the set of objects under study may be 
more clustered than the mass (5 = M maS s)- The function f(f2) — i? ' 6 is the well- 
known velocity-suppression factor due to Peebles, which is in practice a function of 
Qm only, with negligible dependence on the vacuum density (Lahav et al. 1991). 
Redshift-space effects depend on the combination 

/3 = f2°- 6 /b. (181) 

The anisotropy arises because mass flows from low-density regions onto high density 
sheets, and the apparent density contrast of the pattern is thus enhanced in redshift 
space if the sheets lie near the plane of the sky. If we average this anisotropic effect 
by integrating over a uniform distribution of the net boost to the power spectrum 
is 

i**i 2 -&w ( 1+ ^+^ 2 )- < 182 ) 

On small scales, this is not valid. The main effect here is to reduce power through 
the radial smearing due to virialized motions and the associated 'finger-of-God' 
effect. This is hard to treat exactly because of the small-scale velocity correlations. A 
simplified model was introduced by Peacock (1992) in which the small-scale velocity 
field is taken to be an incoherent Gaussian scatter with ID rms dispersion a. This 
turns out to be quite a reasonable approximation, because the observed pairwise 
velocity dispersion is a very slow function of separation, and is all the better if 
the redshift data are afflicted by significant measurement errors (which should be 
included in a). This model is just a radial convolution, and so the /c-space effect is 




fa fa exp[-/c 2 ^ 2 a 2 /2]. 



(183) 
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This effect in isolation gives an average isotropic factor of 

M 2 H4ff^ (184) 

and produces only mild damping (one power of k at large k). This last feature is 
true whatever damping function is assumed, since at large k most surviving signal 
comes from \x ~ 0. An alternative model is to use the observation that the pairwise 
distribution of velocity differences is well-fit by an exponential (probably a superpo- 
sition of Gaussians of different widths from different clumps). For this, the /c-space 
damping function is a Lorentzian: 

6 k ^6 k [l + fcVV/2]" 1 . (185) 

In either case, note that the damping depends on /j, as does the Kaiser factor: both 
are anisotropic in k space and they interfere when averaging to get the mean power. 
What happens in this case is that the linear Kaiser boost to the power is lost at 
large k, where the result is the same as for (3 = (because the main contribution at 
large k comes from small fx). 

In practice, the relevant value of a to choose is approximately 1 / \pl times the 
pairwise dispersion a v seen in galaxy redshift surveys (to this should be added in 
quadrature any errors in measured velocities). At Mpc separation, the pairwise 
dispersion is approximately 

a p ~ 300 - 400 kms" 1 (186) 

(Davis & Peebles 1983; Mo, Jing & Borner 1993; Fisher et al. 1994). We therefore 
expect wavenumbers k > 0.3 h Mpc -1 to be seriously affected by redshift-space 
smearing. 



Measuring the cosmological constant 

There is an alternative potential source of clustering anisotropy in redshift space, 
which is geometrical in nature. We have to turn positions on the sky and redshifts 
into Cartesian coordinates using the following quantities 

, / s dr c 1 . 
A(z) = R — = — 187 

and B{z) = RoSk(r). We normally assume the the Einstein-de Sitter model 

dv c \ 

V ) = flo(irF E "° w (188) 

R r{z) = 2-?-(l-^=L=)=B (z), (189) 



H \ VT+z, 

but if this is not correct, then contours of £(r) will be squashed by a factor F: 

- (190) 
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If A = 0, this distortion is small, but not for significant vacuum energy: for Q m = 0.2, 
F ~ 1.3 for z > 1. Detection of this distortion in e.g. quasar clustering would be an 
attractive means of detecting A. However, this effect interferes with the dynamical 
distortions: for a power-law spectrum, the distorted spectrum is 



P s (k^) oc k 7 



1 + fi' 



1 

J2 



- 1 



Ti — 4 

2 



X 



1 + ^ I-fT- 



1 



(191) 



D[fc^cr p ], 



where D[k/ja p ] is the redshift smearing function (Ballinger, Peacock & Heavens 
1996). To first order in / u 2 , /3 e ff ~ — 0.5n(F — 1). However, the fi 4 dependencies are 
different, so this method may still be attractive with large databases. 



Real-space clustering 

There are a number of methods available which avoid altogether the need to deal 
with the complications of redshift space. These deal with either pure two-dimensional 
clustering, as in angular correlations, or the use of projected correlations in redshift 
surveys. 

An important relation is that between the angular and spatial power spectra. In 
outline, this is derived as follows. The perturbation seen on the sky is 



<Ky) y 2 4>(y) dy, 



(192) 



where cf>(y) is the selection function, normalized such that f y 2 (f>dy — 1, and y is 
comoving distance. The form <p oc y~ 1 ^ 2 exp —(y/y*) 2 is often taken as a reasonable 
approximation to the Schechter function. A flat Universe (Q = 1) is assumed. Now 
write down the Fourier expansion of 5. The plane waves may be related to spherical 
harmonics via the expansion of a plane wave in Spherical Bessel functions ji 



Akr cos 9 



J2( 2£ + 1) i& p K cosd ) 3e(kr), 



(193) 



plus the spherical harmonic addition theorem 



An 



(194) 



where q • q' = cos#. These relations yield the desired result: 



(\aT\ 2 )=4njA 2 (k)f 



y 2 (t>{y)k{ky) dy 



(195) 



What is the analogue of this formula for small angles? Rather than manipulating 
large-£ Bessel functions, it is easier to start again from the correlation function. By 
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writing as above the overdensity observed at a particular direction on the sky as a 
radial integral over the spatial overdensity, with a weighting of y 2 (j)(y), we see that 
the angular correlation function is 

(5(qi)5(q 2 )) = J J (5(yi)5(y 2 )) ylvl^yi)^) d yi dy 2 . (196) 

We now change variables to the mean and difference of the radii, y = (y\ + y2)j2\ 
x = (yl — y2)/2. If the depth of the survey is larger than any correlation length, 
we only get signal when y\ ~ y 2 — y. If the selection function is a slowly-varying 
function, so that the thickness of the shell being observed is also of order the depth, 
the integration range on x may be taken as being infinite. For small angles, we then 
obtain Limber's equation: 



poo />oo 

w(9)= / y 4 (p 2 dy / t{^x 2 +y 2 6 2 ) dx. 

JO J -oo 



(197) 



Theory usually supplies a prediction about the linear density field in the form of the 
power spectrum, and so it is convenient to recast Limber's equation: 



w{9) 



/•OO />C 

/ yU 2 dy / 
Jo Jo 



nA 2 {k) J (ky9) dk/k 2 



(198) 



The power-spectrum version of Limber's equation is already in the form required for 
relation to the angular power spectrum (w = J AgJo(K0)dK/K), and so we obtain 
the direct small-angle relation between spatial and angular power spectra: 



*%=jcj A\K/y) y 5 <p 2 (y)dy. 



(199) 



This is just a convolution in log space, and is considerably simpler to evaluate and 
interpret than the w — £ version of Limber's equation. 

Finally, note that it is easy to make allowance for spatial curvature in the above 
discussion. All that is needed is to replace the Q = 1 volume element y 2 dy by its 
generalised counterpart, y 2 dy/(l — ky 2 ) 1 ^ 2 . 

When working with redshift surveys and treating redshift as a radial coordinate, 
the presence of peculiar velocities or redshift errors causes the correlation function 
to be convolved in the radial direction: 



/oo 
£true(r p ,r) /(7T-r) dr 
-oo 

7 poo 

= -#-/ [r 2 p + (n-x) 2 ]--/ 2 e-* 2 / 2 ^dx, 

V ZTT(J V J -oo 



(200) 



where the latter expression applies for power-law clustering and a Gaussian disper- 
sion. Treating the convolving function f(Ar) as a zero- mean scatter ignores large- 
scale streaming, which becomes more important on larger scales. However, a Fourier 
analysis is clearer in this regime. Looking at the elongation of £(r p , tt) in the redshift 
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direction allows the pairwise velocity dispersion to be estimated; it comes out at a 
relative dispersion of between 300 & 400 kms -1 for pairs of ~ 1 Mpc separation 
(Davis & Peebles 1983; Fisher et al. 1994). 

The effects of peculiar velocities may be evaded by using the correlation function 
evaluated explicitly as a 2D function of transverse (r p ) and radial (n) separation. 
The projection along the redshift axis is then independent of the velocities 

f°° f°° r dr 

w(r p ) = / £(r p , n)dn = 2 £(r) /2 , (201) 

J —oo Jr p V 1 p) 

and has the Abel integral inverse 

^=-\sy ,{ » ] w^- <2o2) 

Improved signal-to-noise in projected correlations can be obtained in the case of a 
sparse-sampled redshift survey (Kaiser 1986a), where there is a large catalogue 
of angular positions from which redshifts are measured for some fraction (either the 
brighter members, or a random subset). Saunders et al. (1992) used the angular 
cross-correlation between the l-in-6 QDOT IRAS-galaxy redshift survey and its 
parent catalogue to obtain the statistic 



/•oo />oo 

S(r) = 2 J t[{r 2 + x 2 ) 1 ' 2 ]dx = 2 J £(y) 



(203) 



2 



At first sight, it is not very attractive to use this to infer the power spectrum, 
because the window function involved is extremely broad: 

-S(r)= [ A 2 (k)^ \£-J {krj\. (204) 
r J k Vkr J 

However, useful results were obtained by Saunders et al. (1992) from a multi-stage 
process where E(r) is first deprojected to obtain £(r), which can then be integrated 
to yield £(r). This has a compact window function, and can be directly related to 
the power spectrum. 



5.4 Bias 

One of the major advances of cosmology in the 1980s was the realization that the 
distribution of galaxies need not trace the underlying density field. The main moti- 
vation for such a view may be traced to 1933 and Zwicky's measurement of the dark 
matter in the Coma cluster. A series of ever more detailed studies of cluster masses 
have confirmed his original numbers: if the Coma mass-to-light ratio is Universal, 
then the density parameter of the Universe is = 0.1 — 0.2. Those who argued that 
the value Q = 1 was more natural (a greatly increased camp after the advent of 
inflation) were therefore forced to postulate that the efficiency of galaxy formation 
was enhanced in dense environments: biased galaxy formation. This probably 
remains the strongest argument for the reality of bias. 

A weaker argument surfaced at around the same time as inflation through the 
discovery of large voids in the galaxy distribution. There was a reluctance to believe 
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that such vast regions could be truly devoid of matter - although this was at a 
time before the discovery of large-scale velocity fields. This tendency was given 
further stimulus through the work of Davis, Efstathiou, Frenk & White (1985), 
who were the first to calculate iV-body models with the CDM spectrum. Since the 
CDM spectrum curves slowly between effective indices of n = — 3 and n = 1, the 
correlation function clearly steepens with time. There is therefore a unique epoch 
when £ will have the observed slope of —1.8. Davis et al. identified this epoch as 
the present and then noted that, for Q = 1, it implied a rather low amplitude of 
fluctuations: ro = 1.3/i -2 Mpc. An independent argument for this low amplitude 
came from the size of the peculiar velocities in CDM models: if given an amplitude 
corresponding to the a% — 1 seen in the galaxy distribution, the pairwise dispersion 
was (T p ~ 1000 - 1500 kms -1 , some 3-4 times the observed value. What seemed to 
be required was a galaxy correlation function which was an amplified version of that 
for mass. This was exactly the phenomenon analysed for Abell clusters by Kaiser 
(1984), and thus was born the idea of high-peak bias: bright galaxies form only 
at the sites of high peaks in the initial density field. This was developed in some 
analytical detail by Bardeen et al. (1986), and was implemented in the simulations 
of Davis et al., leading to the conclusion that the Q = 1 h = 1/2 CDM model now 
gave a good match to observation. 

Since the mid-1980s, fashion has moved in the direction of \ow-f2 universes, 
which removes many of the original arguments for bias. However, the lesson of the 
attempts to save the Q = 1 universe is that cosmologists have learned to be wary 
of assuming that light traces mass. The assumption is that the galaxy density field 
is guilty of bias, until it is shown to be innocent. 

As was shown by Kaiser (1984), the high-peak model produces a linear ampli- 
fication of large-wavelength modes. This is likely to be a general feature of other 
models for bias, so it is useful to introduce the linear bias parameter: 
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This seems a reasonable assumption when 5p/p <C 1. Galaxy clustering on large 
scales therefore allows us to determine mass fluctuations only if we know the value 
of b. For example, the normalization in scale-invariant models may be specified by 
e (the rms potential fluctuation per In k) , but we can only measure the combination 
eb. However, the coinage of the bias parameter has been debased by its use in the 
mildly non-linear regime, where misleading definitions such as b = I /as are to be 
found. 

Even the linear relation cannot be taken for granted, however; if galaxy formation 
is not an understood process, then in principle studies of galaxy clustering may tell 
us nothing useful about the statistics of the underlying potential fluctuations against 
which we would like to test inflationary theories. It is possible to construct models 
(e.g. Bower et al. 1993) in which the large-scale modulation of the galaxy density is 
entirely non-gravitational in nature. 
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Mechanisms for bias 

Why should the galaxy distribution be biased at all? In the context of the high- 
peak model, attempts were made to argue that the first generation of objects could 
propagate disruptive signals, causing neighbours in low-density regions to be 'still- 
born'. However, it turned out to be hard to make such mechanisms operate: the 
energetics and required scale of the phenomenon are very large (Rees 1985; Dekel 
& Rees 1987). A more promising idea, known as natural bias, was introduced by 
White et al. (1987). This relied on the idea that, in an overdense region, an object of 
a given mass will collapse sooner and thus have a higher density and circular velocity. 
Application of a circular-velocity threshold then yields a bias towards high-density 
regions. White et al. argued that such an effect was to be expected owing to the 
Tully-Fisher effect: a tight correlation between luminosity and circular velocity 
for spiral galaxies. 

However, the problem with this model is that it still apparently predicts no bias 
if a strict selection by mass is performed. What is needed is some way in which star 
formation is biased (perhaps by epoch dependent efficiency) in order to produce more 
stars in the galaxies which collapse earlier. A general discussion of this problem was 
given by Cole & Kaiser (1989): suppose an object collapsing at redshift z generates 
a stellar luminosity 

L oc M a (l + z) p . (206) 

Cole & Kaiser show that a perfect Tully-Fisher relation then requires (3 = Sa/2. This 
is easily proved from the usual expression for the virial velocity dispersion resulting 
from gravitational collapse: V oc M 1 / 3 (l -\-z c ) 1 ^ 2 . The above condition removes any 
redshift dependence and leaves V oc L 1 / 30 . The conventional Tully-Fisher slope of 
1/4 then implies a = 4/3, (3 = 2. The natural bias mechanism implicitly depends on 
a strong epoch dependence of star-forming efficiency. In fact, Cole & Kaiser argue 
that a somewhat stronger epoch dependence {j3 > 3) is required to achieve sufficient 
bias to understand cluster mass-to-light ratios. However, Peacock (1990) showed 
that such a high value would predict a large scatter in the Faber- Jackson relation 
between luminosity and velocity dispersion for ellipticals. The data are much more 
closely consistent with j3 ~ 1. 

In short, there appears to be little evidence for traditional bias schemes where 
one tinkers with the efficiency of star formation. There remains the alternative that 
galaxies were born unbiased but subsequently migrated into clusters more quickly 
than dark matter. These dynamical schemes are currently attracting the most at- 
tention (West & Richstone 1988; Carlberg et al. 1990; Carlberg 1991; Couchman 
& Carlberg 1992). The idea of bias being a phenomenon largely confined to clus- 
ters fits well with the emerging picture of large-scale structure: away from clusters, 
it seems that all types of galaxies follow the same overall 'skeleton' of large-scale 
structure, independent of Hubble type (Thuan et al. 1987; Babul & Postman 1990; 
Mo, McGaugh & Bothun 1994). Even luminosity segregation is a very weak and 
controversial effect (Valls-Gabaud et al. 1989; Loveday et al. 1995). This is further 
evidence against earlier pictures in which the voids were filled with mass, but failed 
to produce bright galaxies. Increasingly, it seems that the voids really have been 
largely emptied by gravity, as implied by the large-amplitude peculiar velocities on 
these scales. It would not be surprising if the formation of all classes of galaxy was 
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suppressed in these regions of very low density, but it is implausible that the voids 
contain more than a small fraction (~ 10%) of the total mass in the universe, so the 
implied degree of bias would not be large. 

Outstanding issues 

To sum up the present position, the pictures of the galaxy distribution obtained 
with different sets of tracer objects have certain features in common (mainly the 
behaviour in low-density regions: the existence of filaments and voids), but diverge 
greatly in regions of high density. This morphological segregation is a long- 
established phenomenon (e.g. Dressier 1980); it is known that E/SO galaxies increase 
from perhaps 20% of the galaxy population in the field to almost 100% in the cores of 
rich clusters where the overdensities are ~ 10 3 . This is closely related to the relative 
proportions of optically-selected and IRAS galaxies as a function of density. At high 
overdensities, the fraction of optical galaxies which are IRAS galaxies declines by a 
factor ~ 3 - 5 from the mean (Strauss et al. 1992), reflecting the fact that IRAS 
galaxies are mainly spirals. 

We need to be able to decide on physical grounds which tracer is more likely to 
follow the mass in regions of high density, and in some cases this is quite easy. For 
example, it is well understood how and why the spatial correlations of rich clusters 
will be an amplified version of the underlying density correlations (at least for Gaus- 
sian statistics). Given this, it is a safe bet that the clustering properties of elliptical 
galaxies also over-estimate the density correlations; the known phenomenon of mor- 
phological segregation means that ellipticals are closely associated with clusters. We 
know in both the cases of clusters and ellipticals that we are excluding the low- 
density universe simply through our observational selection, so an unrepresentative 
answer would be expected. 

Much harder is the critical decision between optically-selected galaxies or IRAS 
galaxies. The former give dynamical determinations consistently in the region of 
Q = 0.2 if optical light traces mass, whereas the latter favour Q = 1, because 
IRAS galaxies peak up less strongly than optically-selected galaxies in high-density 
regions, as discussed above. So, which (if either) of these fields follows the mass? The 
problem for the committed believer in Q = 1 is that it is much more plausible that 
it is the IRAS result which is corrupted. Morphological segregation is thought to 
arise because spiral discs have trouble surviving in high-density environments, and 
it is plausible that the X-ray emitting gas in clusters is all that remains of material 
which might have made spiral discs in a less troubled habitat. We conclude that 
either Q ~ 0.2, or that some other mechanism has operated to boost the optical 
light in clusters, which is then fortuitously cancelled by the suppression of IRAS 
emission from these regions. This interpretation is made still more contrived by an 
accounting of the total baryonic material in clusters. Within the central ~ 1 Mpc, 
the masses in stars, X-ray emitting gas and total dark matter can be determined 
with reasonable accuracy (perhaps 20% rms), and allow a minimum baryon fraction 
to be determined: 




>ns 



> 0.009 + 0.050 h~ 3/2 



(207) 
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(White et al. 1993). This equation is often referred to as the baryon catastro- 
phe, for the following reasons. Assume for now that the baryon fraction in clusters 
is representative of the whole universe, and adopt the primordial nucleosynthesis 
prediction of f2 B h 2 = 0.0125. This gives an equation for Q: 



which is a limit varying between 0.21 and 0.33 for h between 0.5 and 1. This is a 
catastrophe for the Einstein-de Sitter universe, in that clusters have to be biased not 
only in the light they emit, but also in the sense of containing a larger baryon fraction 
than the average. However, producing a large-scale separation of dark matter and 
baryons on scales which are little past the turn-round phase is very difficult. If the 
density parameter is really unity, it appears that the nucleosynthesis density must 
be too low by at least a factor 3. 

Since there continue to be compelling reasons to expect Q = 1, a high priority in 
current cosmological research will continue to be to produce a convincing mechanism 
for bias, and to detect its traces in galaxy properties. However, this should not blind 
us to the fact that the simplest interpretation of the existing evidence is that the 
Universe has Q < 1.. 

5.5 Power-spectrum data 

The history of attempts to quantify galaxy clustering goes back to Hubble's proof 
that the distribution of galaxies on the sky was non-uniform. The major post-war 
landmarks were the angular analysis of the Lick catalogue, described in Peebles 
(1980), and the analysis of the CfA redshift survey (Davis & Peebles 1983). It has 
taken some time to obtain data on samples which greatly exceed these in depth, 
but several pieces of work appeared around the start of the 1990s which clarified 
many of the discrepancies between different surveys, and which paint a relatively 
consistent picture of large-scale structure (see Peacock & Dodds 1994). 

Clustering results are often published in the form of the variance (a 2 ) of 5 as a 
function of scale - using either cubical cells of side i (Efstathiou et al. 1990b) and 
Gaussian spheres of radius R G (Saunders et al. 1991). For a power-law spectrum 
(A 2 oc /c n+3 ), we have for the Gaussian sphere 



For n < 0, this formula also gives a good approximation to the case of cubical cells, 
with R G — > £/y/l2. The result is rather insensitive to assumptions about the power 
spectrum, and just says that the variance in a cell is mainly probing waves with 
A ~ 2£. Since we know the shape of A 2 reasonably well, we can get very accurate 
effective wavenumbers and plot the a 2 values on the A 2 - k plane directly using 
these /c e ff values. Such a compilation of results is shown in Figure 4. 

There is a wide range of power measured, ranging over perhaps a factor 20 
between the real-space APM galaxies and the rich Abell clusters. Are these mea- 
surements all consistent with one Gaussian power spectrum for mass fluctuations? 
The corrections for redshift-space distortions and nonlinearities can be applied to 
these data to reconstruct the linear mass fluctuations, subject to an unknown de- 
gree of bias. The reconstruction analysis has available eight datasets containing 91 



Q < 0.25 [h 1 ' 2 + 0.18/i 2 ] 
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(209) 



Inflationary Cosmology and Structure Formation 61 



Fig. 4. (a) The raw power spectrum data in the form A 2 = da 2 /dink; all data with 
the exception of the APM power spectrum are in redshift space. The two lines shown for 
reference are the transforms of the canonical real-space correlation functions for optical 
and IRAS galaxies (ro = 5 and 3.78 fo _1 Mpc and slopes of 1.8 and 1.57 respectively), 
(b) The power-spectrum data, individually linearized assuming Q = bi = 1. There is an 
excellent degree of agreement, particularly in the detection of a break around k = O.OSh. 

distinct k — A 2 pairs. The modelling has available five free parameters in the form 
of Q and the four bias parameters for Abell clusters, radio galaxies, optical galaxies 
and IRAS galaxies (6 A , b n: b Q , b^; however, only two of these really matter: Q and a 
measure of the overall level of fluctuations. For now, we take the IRAS bias param- 
eter to play this latter role. Once these two are specified, the other bias parameters 
are well determined - principally from the linear data at small k, and have the ratios 

b A : b R : b Q : b, = 4.5 : 1.9 : 1.3 : 1, (210) 

to within 6 per cent rms. 

The various reconstructions of the linear power spectrum for the case [2 = ^ = 1 
are shown superimposed in Figure 4, and display an impressive degree of agreement. 
This argues very strongly that what we measure with galaxy clustering has a direct 
relation to mass fluctuations, rather than the large-scale clustering pattern being 
an optical illusion caused by non-uniform galaxy-formation efficiency (Bower et al. 
1993). If this were the case, the shape of spectrum inferred from clusters should 
have a very different shape at large scales, contrary to observation. 

Large-scale power-spectrum data and models 

It is interesting to ask if the power spectrum contains any features, or whether 
it is consistent with a single smooth curve. A convenient description is in terms 
of the CDM power spectrum, which is A 2 (k) oc k n+3 T 2 . We shall use the BBKS 
approximation for the transfer function: 
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Fig. 5. The linearized power-spectrum data of Figure 4, averaged over bins of width 0.1 in 
log 10 k, compared to various CDM models. These assume scale-invariant initial conditions, 
with the same large-wavelength normalization. Different values of the fitting parameter 
Qh = 0.5, 0.45, . . . 0.25, 0.2 are shown. The best-fit model has Oh = 0.25 and a normal- 
ization of a 8 (IRAS) = 0.75. 

_ ln(l + 2.34 9 ) 

k 2.34g (211) 

[1 + 3.89q + (16.1q) 2 + (5.46q) 3 + (6.71g) 4 ]" 1/4 , 

where q = k/[Qh 2 Mpc -1 ]. Since observable wavenumbers are in units of /iMpc -1 , 
the shape parameter is the apparent value of Qh. This scaling applies for models 
with zero baryon content, but there is an empirical scaling (Sugiyama 1995) that 
can account for this: 

T fc (fc) = T BBKS (k/[Qh 2 exp -(Q B + Q B /Q)\). (212) 

The symbol r* is used to refer to Qh in the BBKS spectrum as an empirical fitting 
parameter, on the understanding that it would mean the above combination if CDM 
models were taken literally. Fitting this spectrum to the large-scale linearised data 
of Figure 5 requires the parameters 

r* ~ 0.25 + 0.3(l/n- 1), (213) 

<t 8 (IRAS) ~ 0.75, (214) 

in agreement with many previous arguments suggesting that an apparently low- 
density model is needed. For any reasonable values of h and baryon density, a high- 
density CDM model is not viable. Even a high degree of 'tilt' in the primordial 
spectrum (Cen et al. 1992) does not help change this conclusion unless n is set so low 
that major difficulties result when attempting to account for microwave-background 
anisotropies. 
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The fit of this model is illustrated in Figure 5, which makes it clear that the 
problem with CDM is the shape of the power spectrum, rather than the absolute 
amount of power at large scales. The linear transfer function does not bend sharply 
enough at the break wavenumber if a high-density r* = 0.5 model is adopted. 
An important general lesson can also be drawn from the lack of large-amplitude 
features in the power spectrum. This is a strong indication that collisionless matter 
is deeply implicated in forming large-scale structure. Purely baryonic models contain 
large bumps in the power spectrum around the Jeans' length prior to recombination 
{k ~ 0.03 Qh 2 Mpc -1 ), whether the initial conditions are isocurvature or adiabatic 
{e.g. Section 25 of Peebles 1993). It is hard to see how such features can be reconciled 
with the data, beyond a 'visibility' of perhaps 20 - 30%. 

Small-scale clustering data 

It should clearly be possible to reach stronger conclusions by using the data at 
larger k. However, here the assumption of linear bias is clearly very weak, and 
what is needed is a model for the scale dependence of the bias. Mann, Peacock & 
Heavens (1996) argue that an empirical approach can be taken here, even lacking the 
physics of bias. We know that, if Q = 1, the density of light in high-density regions 
must receive an additional enhancement. There is also the possibility that galaxy 
formation may be suppressed in voids. General arguments (Coles 1993) indicate 
that such local bias should produce a systematic steepening of the correlation 
function, so that the effective bias is larger on small scales. Experiments with such 
local density-field modifications on numerical datasets suggests that the effect on 
the power spectrum is of a steepening which can be roughly approximated by 

1 + A 2 {k)^[l + b 1 A 2 ] b \ (215) 

where [6162] x l 2 would be the bias parameter in the linear regime. Such an expression 
can fit bias schemes from high-peak bias as in Davis et al. (1985) or the 'physical 
bias' seen in full hydro dynamical simulations {e.g. Cen & Ostriker 1992). Generally 
the steepening is not so extreme, and it is hard to find models where 62 exceeds b±. 

Irrespective of a priori considerations, such an expression accounts well for the 
difference in real-space clustering of APM and IRAS galaxies, and can map one 
onto the other with almost uncanny precision (see Figure 6) , with a relative bias of 
61 = 1.2, b 2 = 1.1 or ^apm/^iras — 1-15. Of particular interest is the inflection in the 
spectrum around k ~ 0.1 /iMpc -1 , which seems likely to be real, since it is seen in 
two rather different datasets, which probe different regions of space. 

Attempting to fit the small-scale clustering data now complicates the picture 
from large scales, since the nonlinear extrapolation of the r* = 0.25 model is not 
consistent with the small-scale clustering. Figure 7 compares three different models 
with the data: Einstein-de Sitter and open and flat J? = 0.3 models. There is a 
tension between the data and all of these models, in that it seems impossible to 
fit both large scales and small scales simultaneously. Without bias, the correct am- 
plitude of small-scale clustering requires as ^ 0.7; this greatly under-predicts the 
clustering at k ~ 0.1 /iMpc -1 unless r* < 0.1 is adopted, in which case the power 
at k ~ 0.02 /iMpc -1 is greatly exceeded. The Q = 1 model needs a lower nor- 
malization, and so does not exceed the small-scale data, but it suffers from related 
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problems of shape. Any hypothetical bias which would scale the mass spectrum to 
that of light would need to be non-monotonic, with a smaller effect at k ~ 2 h Mpc -1 
than on larger scales. 

What then is required of a linear power spectrum that would fit the data? 
None of the CDM-like alternatives considered above explain the inflection at 
k ~ 0.1 /iMpc -1 , and it is unlikely to be produced by any bias scheme, since these 
always tend to give a smooth scale dependence. The general conclusion is therefore 
that there must be a relatively sharp break in the linear spectrum around this point. 

A further general point which emerges from this plot is that it is something of 
a puzzle that the clustering data continue as an unbroken n ~ — 1 power law up 
to A 2 ~ 10 3 . As the regime of virialized clustering is reached, a break to a flatter 
slope would be expected; only the open models fail to show this feature, which is a 
robust prediction for any smooth linear spectrum. What is needed is a small linear 
growth-suppression factor g(Q), with Q m < 0.5 for open models or < 0.1 for flat 
models This is far from being a conclusive argument for open models, but it does 
require a coincidence from the bias mechanism: that the galaxy correlations should 
be steepened just where the mass correlations are saturating. 

We can implement these ideas with a simple empirical model, which works ex- 
tremely well. Consider a spectrum in the form of a break between two power laws: 



(fc/fcp 

1 + (k/k 

As shown in Figure 8, this matches the data very nicely, if we choose the parameters 

k = 0.3 /iMpc" 1 
h = 0.05 /iMpc" 1 

a = 0.8 

(3 = 4.0. 

A value of (3 = 4 corresponds to a scale-invariant spectrum at large wavelengths, 
whereas the effective small-scale index is n = —2.2. The linear spectrum is not 
required to be non-zero for k > 1 /iMpc -1 , and so a variety of other possibilities 
would be made to work, including those with short-wavelength cutoffs. 



CMB anisotropics 

A consistent model must match the normalization of the mass fluctuations on large 
scales inferred from fluctuations in the Cosmic Microwave Background. In making 
this comparison, it is important to be clear that the CMB fluctuations depend only 
on the very large-scale P oc k n portion of the spectrum. Predictions of smaller-scale 
fluctuations such as the amplitude as then require additional information in the 
form of the shape parameter r* . Rather than quoting the as implied by the CMB, 
it is therefore clearer to give the large-scale normalization separately, with as then 
depending on the choice of r* . 

Bunn, Scott & White (1995) and White & Bunn (1995) discuss the large-scale 
normalization from the 2-year COBE data in the context of CDM-like models. The 
final 4-year COBE data favour slightly lower results, and we scale to these in what 
follows. For scale-invariant spectra and Q = 1, the best normalization is 
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Fig. 6. (a) The real-space power spectra of APM and IRAS galaxies, as deduced by Baugh 
& Efstathiou (1993; 1994) and Saunders et al. 1992. The APM data have been boosted by 
a factor 1.2 because clustering evolution was not allowed for in the Baugh & Efstathiou 
data, (b) The same with a two-parameter scale-dependent boost to the IRAS data. The 
agreement is outstanding, except for the three largest-scale IRAS points, and these can be 
seen to be too high from a comparison with the redshift-space results. 



Fig. 7. The clustering data for optical galaxies, compared to three models with r* = 0.25: 
(a) Q = 1, as = 0.5; (b) Q m = 0.3, fi v = as = 1. (c) Q m = 0.3, Q v = 0.7 as = 1. Linear 
spectra are shown dotted; evolved nonlinear spectra are solid lines. All of these models are 
chosen with a normalization which is approximately correct for the rich-cluster abundance 
and large-scale peculiar velocities. In all cases, the shape of the spectrum is wrong. The 
high-density model would require a bias which is not a monotonic function of scale, whereas 
the low-density models exceed the observed small-scale clustering. 
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Fig. 8. An empirical double power-law model for the power spectrum provides an extemely 
good fit to the optical-galaxy power spectrum but requires an open universe and a sharp 
break in the spectrum to a rather flat (n < —2) high-A; behaviour. 

A 2 (k) = (k/OmzrhMpc' 1 ) 4 , (218) 

equivalent to <5 rms = 18.0 juK, or e = 3.07 x 10~ 5 in the notation of Peacock (1991), 
with an rms error in density fluctuation of 8%. 

For low-density models, a naive analysis as in PD suggests that the power spec- 
trum should depend on Q and the growth factor g as P oc g 2 /Q 2 . Because of time 
dependence of gravitational potential (integrated Sachs- Wolfe effect) and spatial 
curvature, this expression is not exact, although it captures the main effect. From 
the data of White & Bunn (1995), a better approximation is 

A 2 (k)K^g - 7 . (219) 

This applies for low- J? models both with and without vacuum energy, with a max- 
imum error of 2% in density fluctuation provided Q > 0.2 (and gives the same as 
values as Gorski et al. (1995), when the appropriate r* corrections are made, to 
within 3%). Since the rough power-law dependence of g is g ~ J? - 65 and i? 23 
for open and flat models respectively, we see that the implied density fluctuation 
amplitude scales approximately as i? -012 and J? -0 - 69 for these two cases. The de- 
pendence is very weak for open models, but vacuum energy implies very much larger 
fluctuations. These results are illustrated for CDM spectra in Figure 9, which shows 
08 as a function of r* for three models. For r* = 0.25, open low-density models 
are close to the required as = 1, whereas flat models have an amplitude perhaps a 
factor 2 too high. Einstein-de Sitter models have a 8 = 0.65, which is only slightly 
high. 

What if we tilt the spectrum? For tilt, one evaluates as as in the above no-tilt 
case, and then scales as 
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Fig. 9. The clustering normalization as as a function of r* , predicted from COBE assuming 
scale-invariant primordial fluctuations. Note that flat low-density models require a much 
larger normalization than do open models. 

o-g oc exp[2.3(n- 1)]; (220) 

since a% measures the power spectrum at k ~ 0.2 /iMpc" 1 , this corresponds to say- 
ing that COBE determines the spatial power spectrum at an effective wavenumber 
of about 0.002 /iMpc -1 . If gravity waves are included with the usual inflationary 
coupling between wave amplitude and tilt, the effect increases to 

a 8 oc exp[4.3(n- 1)]. (221) 

The flat models can then have as reduced by the required factor of 2, but a sub- 
stantial degree of tilt is needed (n ~ 0.70 or 0.84, the latter figure including gravity 
waves). These are significantly larger degrees of tilt than would be expected from 
at least the simplest inflationary models. 

In any case, the CDM model is, as we have seen, in some difficulty as a general 
description of the spectrum. A more robust datum is probably the power on the 
largest reliable scales: 

A 2 opt (k = 0.02 h Mpc" 1 ) ~ 0.005, (222) 

which is to be compared to a COBE scale-invariant prediction of 0.0054. Scaling 
as J?" - 24 or J?" 1 - 38 boosts this by a factor of 1.5 (open Q = 0.2) or 9.2 (flat 
J? = 0.2). The former factor is within the plausible effect of a transfer function, 
but the latter is not. The required tilt to remove the additional factor 6 in power 
is gross: n = 0.2, pivoting the spectrum about k = 0.002 h Mpc -1 . Allowing for 
gravity waves improves this to n = 0.7, but the conclusion remains that low-density 
flat models require an extremely large degree of tilt in order to be viable. 
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What does it all mean? 

What then is the interpretation of the spectrum? A CDM spectrum with r* ~ 0.25 
is not consistent with Q = 1 and any plausible estimate for h. However, even the 
low-T* spectra are probably of the wrong shape, so it is not clear if one can argue 
from the best- fitting r* for Q < 1. 

Interesting alternatives with high density are either mixed dark matter (Holtz- 
man 1989; Klypin et al. 1993), or non-Gaussian pictures such as cosmic strings + 
HDM, where the lack of a detailed prediction for the power spectrum helps ensure 
that the model is not yet excluded (Albrecht & Stebbins 1992). Isocurvature CDM 
is also attractive in that it gives a rather sharper bend at the break scale, as the 
data seem to require. However, this model conflicts strongly with limits on CMB 
anisotropies, and cannot be correct. (Efstathiou & Bond 1986). Mixed dark mat- 
ter seems rather ad hoc, but may be less so if it is possible to produce both hot 
and cold components from a single particle, with a Bose condensate playing the 
role of the cold component (Madsen 1992; Kaiser, Malaney & Starkman 1993). The 
main problems with an MDM model are ones generic to any model with a very flat 
high-/c spectrum in a high-,!? universe: difficulty in forming high-redshift objects and 
difficulty in achieving a steep correlation function on small scales. 

Alternatively, if the good fit of a low-density CDM transfer function is taken 
literally, then perhaps this is a hint that the epoch of matter-radiation equality 
needs to be delayed. An approximate doubling of the number of relativistic degrees 
of freedom would suffice - but this would do undesirable violence to primordial 
nucleosynthesis: any such boost would have to be provided by a particle which 
decays after nucleosynthesis. The apparent value of fih depends on the mass and 
lifetime of the particle roughly as 

^apparent = ^ [1 + (ml y T years ) 2 / 3 ] (223) 

(Bardeen, Bond & Efstathiou 1987; Bond & Efstathiou 1991), so a range of masses is 
possible. Apart from making the observed large-scale structure, such a model yields 
a small-scale enhancement of power which could lead to early galaxy formation 
(McNally & Peacock 1995). 

Of course, the simplest alternative is to admit that the above attempts to save 
the Einstein-de Sitter model are too contrived, and that Q m is < 1. This would 
make a low-.T* model easier to understand, but it introduces few new possible ways 
to alter the shape of the linear power spectrum. There are of course the oscillatory 
features expected in the power spectra of baryon-dominated universes {e.g. Section 
25 of Peebles 1993), but these occur at too small k. A possibility would be warm 
dark matter with a cutoff at k ~ 0.2 /iMpc -1 , but it may well be that something 
entirely new is needed. 

Apart from lessening the large-scale structure problems, low densities make life 
easier in two other ways: the universe is made older for a given h and strong bias 
need not be accounted for. The main difficulty which will need to be overcome in 
order for the reality of a low-density universe to be accepted is to understand the 
values for Q which have emerged from attempts to us large-scale peculiar velocities 
to 'weigh' the universe {e.g. Dekel 1994. The amplitude of such motions should not 
be a problem, as they should scale with 5v oc f2°' 6 as, and the normalization inferred 
from nonlinear systems is as oc J? -0,5 . However, there is a problem in the degree 
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of bias inferred from comparing such velocities with the density field. In principle, 
this allows one to determine (3 = i? 6 /6, and values of (3 ~ 1 have been inferred 
for IRAS galaxies. If Q ~ 0.2, this would require b ~ 0.5. However, Loveday et al. 
(1996) have found the much lower value (3 ~ 0.5 from redshift-space distortions in 
the Stromlo-APM redshift survey. If these lower values are confirmed, the strength 
of the case for low Q and little bias would become overwhelming. 

This leaves unresolved the distinction between an open model and one in which 
a significant vacuum energy keeps the inflationary k = prediction (Efstathiou et 
al. 1990b). The general case Q v ^ and k ^ is also a logical possibility, but 
not an appealing one. As constraints on h, galactic ages, and details of the CMB 
measurements improve, there is the happy possibility that a clear discrimination 
between these alternatives may soon be reached. 



5.6 Gaussianity 
Skewness 

How are we to distinguish observationally whether the density field of the Universe is 
Gaussian? We need to look at more subtle statistics than just the power spectrum. In 
principle, one might use the higher-order n-point correlation functions, since these 
are directly related to the power spectrum for a Gaussian field. On small scales, 
nonlinear evolution must produce non-Gaussian behaviour. For any a, a Gaussian 
density distribution will produce a tail to unphysical 5 < — 1 values. The lognormal 
model 5 — > exp[5 — cr 2 /2] is the simplest analytical modification which cures this 
problem (Coles & Jones 1991). This distribution is skew, with a tail towards large 
values of 5. The question is whether the density field contains a greater degree of 
non-Gaussianity than that induced by gravitational evolution. 

Non- Gaussian behaviour may be measured through the skewness parameter (not 
the skewness itself): 

(224) 

which can be calculated through second-order gravitational perturbation theory, and 
should be a constant of order unity. Gaztahaga (1992) showed that the skewness 
parameter for the APM galaxy survey was approximately constant with scale, at 
the value expected for nonlinear gravitational evolution of a Gaussian field. Does 
this mean (a) that conditions are Gaussian; (b) that 6=1? This is possible, but the 
effects of bias need to be understood first. As a simple example, consider a power- 
law modification of a lognormal field: p' oc p b . Since this is generated by a Gaussian 
field, it is easy to find the skewness parameter, which is 

S = 2 + e b2a \ (225) 

In the linear regime, the skewness is independent of b and so this sort of model 
would not violate the observation that the moments (5 3 ) and (5 2 ) are in the correct 
ratio for straightforward gravitational evolution without bias. 
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Topology 

An interesting alternative probe of Gaussianity was suggested by Gott, Melott & 
Dickinson (1986): the topology of the density field. To visualise the main principles, 
it will help to think initially about a 2D field. Two extreme non-Gaussian fields 
would consist either of discrete 'hotspots' surrounded by uniform density, or the 
opposite: discrete 'coldspots'; the picture in either case is a set of polka dots. Both 
of these cases are clearly non-Gaussian just by symmetry: the contours of average 
density will be simply-connected circles containing regions which are all either above 
or below the mean density, but in a Gaussian field (or any symmetric case), the 
numbers of hotspots and coldspots must balance. 

One might think that things would be much the same in 3D: the obvious alter- 
natives are 'meatball' or 'Swiss cheese' models. However, there is a third topological 
possibility: that of the sponge. In our two previous examples, high- and low-density 
regions were distinguished by their connectivity (whether it is possible to move 
continuously between all points in a given set). In contrast, a sponge has both classes 
of region being connected: it is possible to swim to any point through the holes, or 
to burrow to any point within the body of the sponge; filling a sponge with cement 
and etching away the sponge produces a cement sponge. Again, just by the sym- 
metry between overdensity and underdensity, a Gaussian field in 3D must have a 
sponge-like topology. 

The above discussion has focused on the properties of contour surfaces. These 
properties can be studied quantitatively via the genus: the number of 'holes' in a 
surface (zero for a sphere, one for a doughnut etc.). This is related to the Gaussian 
curvature of the surface, K = l/{r\r2) (where r± and ri are the two principal radii 
of curvature), via the Gauss-Bonnet theorem (see e.g. Dodson & Poston 1977) 

C = J KdA = 4tt(1 - G), (226) 

where G is the genus. Topological results are sometimes instead quoted in terms of 
the Euler-Poincare characteristic, which is —2 times the genus. 

For Gaussian fields, the expectation value of the genus per unit volume (denoted 
by g) is (see Hamilton, Gott & Weinberg 1986) 



(227) 



For the median density contour (v = 0), the curvature is negative, implying that the 
surface has genus greater than unity. For \u\ > 1, however, the curvature is positive 
- as expected if there are no holes. The contours become simply connected balls 
around either isolated peaks or voids. 

It is interesting to note that the genus carries some information about the shape 
of the power spectrum, not in the behaviour with z/, but in the overall scaling. For 
Gaussian filtering, 




1 /3 + n 



3/2 
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and so the effective spectral index can be determined in this way. 

A similar procedure can be carried out in 2D (see Melott et al. 1989; Coles & 
Plionis 1991). The Gauss-Bonnet theorem is now 



C = J K dA = 2Tr(l-G), 



(229) 



where the meaning of 1 — G is the number of isolated contours minus the number 
of contour loops within other loops (sometimes the 2D genus is defined with the 
opposite sign; our convention follows that in 3D and the signs below are consistent). 
The result for the 2D genus per unit area is 



(230) 




The 3D case is analogous but, as always, more messy; see BBKS. 



Fig. 10. Results from the Genus analysis applied to 3D redshift data, taken from Gott et al. 
(1989). The small 'meatball shift' seen here is argued by the authors to be consistent with 
non-linear evolution from Gaussian initial conditions. It is interesting that the behaviour 
becomes more nearly Gaussian as we move to deeper samples which allow larger filtering 
lengths. Such plots constitute the strongest evidence we have that cosmic structure did 
indeed form via gravitational instability from Gaussian primordial fluctuations. 

Applications of this method to real data (Figure 10) naturally reveal departures 
from Gaussian behaviour - one wishes to test whether the initial conditions were 
Gaussian, realising that nonlinear evolution will cause the field to become non- 
Gaussian. This means that either A-body simulations have to be used to predict 
the degree of non-Gaussian behaviour (usually in the 'meatball' direction), or one 
is confined to smoothing the data heavily to probe only large linear/angular scales. 
These should still be Gaussian, but of course by smoothing over many small regions 
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there is the danger that the central limit theorem will produce a Gaussian-like result 
in all cases. 



Fourier tests 

A third class of test was suggested by Feldman, Kaiser & Peacock (1994), and rests 
on measuring phase correlations between different mode amplitudes in the Fourier 
analysis of redshift surveys. First note that the two-point function in k space for a 
homogeneous random statistical process is always a delta-function: 

(5 k (k)5i(k')} = ^P(fc) 5 D (k - k'), (231) 

and that this applies for both Gaussian and non-Gaussian processes. To prove this, 
write down the definition of 5 k twice and multiply for different wavenumbers, using 
the reality of 5: 

S k S* k , = ^ J 6(t)6(t + x) e* k '' x (fx J e ir ^~^ d 3 r. (232) 

Performing the ensemble average for a stationary statistical process gives 

<*(r)<5(r + x)> =£(*), (233) 

independent of r. The integral over r can now be performed, showing that 
(5 k (k)5 k (k')) vanishes unless k = k' in the discrete case, or that in the continuum 
limit there is a delta-function in k space. 

This result applies in the limit of an infinite survey. When there is a limited 
survey volume, delimited by the mean density n(r), we know that the Fourier coef- 
ficients are convolved by the transform of n(r). There will therefore be a coherence 
length in k space of order the reciprocal of the survey depth, over which length 
Fourier modes will have a significant two-point correlation. In the generalization 
where the survey galaxies may be weighted, Feldman, Kaiser & Peacock show that 
the exact expression for the two-point function is 



where 



{5 k (k)5* k (k + 5k)) = P(k)Q(5k) + S{5k), (234) 
/ w 2 n 2 expfzk ■ r] 



Q(k) = 
S(k) 



f w 2 n 2 d 3 r 

r 2- r-v }A3 (235) 
J w n exp|zk • rj a r 

J w 2 n 2 d 3 r 



Furthermore, in the case of Gaussian fields only, this two-point function of ampli- 
tudes is simply related to the two-point function for the power: 

(5P(k)5P(k + 5k)) = \(5 k (k)5t(k + 5k))\ 2 . (236) 



The significance of these results is that they allow direct constraints to be placed 
on a large class of non-Gaussian models in which the character of the linear density 
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fluctuations is Gaussian, but with a spatial modulation or intermittency, so that 
there are 'quiet' and 'noisy' parts of space: 



Such a model was proposed on phenomenological grounds by Peebles (1983), but 
might also be realised in inflationary models with multiple scalar fields. The modu- 
lating field M(r) acts in the same way as a mask imposed by observational selection, 
multiplying the effective n. It therefore convolves the transform of n and broadens 
it. The signature of this form of non-Gaussianity is thus an extended tail of corre- 
lated power in the transform of the survey, and the agreement of the observed and 
expected power correlations can be used to set limits on the non-Gaussianity. Figure 
11 shows the application of this analysis to the combined QDOT and 1.2-Jy surveys, 
and the excellent agreement provides a strong piece of evidence for the Gaussian 
nature of primordial fluctuations. 



Fig. 11. The normalized 2-point correlation of the power measured in the combined QDOT 
and 1.2-Jy IRAS redshift surveys, plotted against k separation 8k/ /iMpc -1 (Stirling & 
Peacock 1996). Wavenumbers k < 0.1 /iMpc -1 are considered. The observed correlations 
follow the expected form very closely, and limit any modulation of the statistical properties 
of the density field on 100 Mpc scales. 



(J(r)-xJ(r) [1 + M(r)]. 



(237) 
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6 Conclusions 

The testability of inflation 

This brief summary of inflationary models has presented the 'party line' of many 
workers in the field for the simplest ways in which inflation could happen. It is a 
tremendous achievement to have a picture of this level of detail - but is it at all 
close to the truth? 

What are the predictions of inflation? The simplified package is (1) k = 0; 
(2) scale-invariant, Gaussian fluctuations, and these are essentially the only tests 
discussed in the 1990 NATO symposium on observational test of inflation (Shanks 
et al. 1991). As the observations have hardened, however, there has been a tendency 
for the predictions to weaken. The flatness prediction was long taken to favour an 
Q = 1 Einstein-de Sitter model, but timescale problems have moved attention to 
models with f? ma tter + Vacuum = 1- More recently, inflationary models have even 
been proposed which might yield open universes. This is achieved not by fine-tuning 
the amount of inflation, which would certainly be contrived, but by appealing to 
the details of the mechanism whereby inflation ends. It has been proposed that 
quantum tunnelling might create a 'bubble' of open universe in a plausible way 
(Bucher, Goldhaber & Turok 1995; Gorski et al. 1995). It seems that inflation can 
never be disproved by the values of the global cosmo logical parameters. 

A more characteristic inflationary prediction is the gravity-wave background. 
Although it is not unavoidable, the coupling between tilt and the gravity-wave con- 
tribution to CMB anisotropies is a signature expected in many of the simplest 
realizations of inflation. An observation of such a coupling would certainly consti- 
tute very powerful evidence that something like inflation did occur. Sadly, this test 
loses its power as the degree of tilt becomes smaller, which does seem to be the 
case observationally (Peacock & Dodds 1994). The most convincing verification of 
inflation would of course be the direct detection of the predicted flat spectrum of 
gravity waves in the local universe. However, barring some clever new technique, 
this will remain technologically unfeasible for the immediate future. 

It therefore seems likely that the debate over the truth of inflation will continue 
without a clean resolution. There are certainly points of internal consistency in 
the theory which will attract further work: the form of the potential, and whether 
it can be maintained in the face of quantum corrections. Also, there is one more 
fundamental difficulty, which has been evaded until now. 

The A problem 

All our discussion of inflation has implicitly assumes that the zero of energy is set 
at A = now, but there is no known principle of physics which requires this to be 
so. By experiment, f2 vac < 1, which corresponds to a density in the region of 10 100 
times smaller than the GUT value. This fine tuning represents one of the major 
unsolved problems in physics (Weinberg 1989). 

Other ways of looking at the origin of the vacuum energy include thinking of 
it as arising from a Bose-Einstein condensate, or via contributions from virtual 
particles. In the latter case, the energy density would be the rest mass times the 
number density of particles. A guess at this is to set the separation of particles at 
the Compton wavelength, h/mc, yielding a density 
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This exceeds the cosmological limit unless m < 10 _8 m e . One proposal for evading 
such a nonsensical result was made by Zeldovich: perhaps we cannot observe the 
rest mass of virtual particles directly, and we should only count the contribution of 
their gravitational interaction. This gives instead 



which is acceptable if m < 100m e - still far short of any plausible GUT-scale or 
Planck- scale cutoff. 

We are left with the strong impression that some vital physical principle is miss- 
ing. It is perhaps just as well that the average taxpayer who funds research in physics 
does not realise how much difficulty we have in understanding even nothing at all! 

Fortunately, the existence of a non-zero vacuum density is not entirely a philo- 
sophical conundrum, but is subject to empirical verification in cosmology. If A exists 
at above the level of a few tenths of the critical density, it can be detected by a com- 
bination of geometrical tests, its effect on cosmological ages, and detailed signatures 
in CMB anisotropies. The challenge of confirming or ruling out a cosmologically 
significant vacuum energy is therefore developing into one of the dominant themes 
of cosmology in the 1990s, and is an area where we can reasonably hope to reach a 
decision within the next few years. 
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